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Practical Design Rules for Optimum 
Finite Impulse Response Low- Pass 


Digital Filters 


By 0. HERRMANN,* L. R. RABINER, and D. 8S. K. CHAN 
(Manuscript received November 22, 1972) 


Although a great deal is known about design techniques for optimum 
(in a minimax error sense) finite impulse response (FIR) low-pass digital 
filters, there have not been established any practical design rules for such 
filters. Thus, a user is unable to easily decide on the (approximate or exact) 
filter order required to meet his design specifications and must resort to 
tables or trial and error procedures. In this paper, such a set of design 
rules is given. In the case of very narrow bandwidth or very wide bandwidth 
filters, analytic relations between the filter parameters can be readily ob- 
tained. In all other cases, exceedingly good linear and nonlinear fits to the 
data can be obtained over somewhat restricted ranges of the parameters. 
These fitting procedures lead to a practical set of simple design rules for 
estimating filter order from the desired specifications. 


I. INTRODUCTION 


The problem of designing an optimal (in the minimax sense) low- 
pass FIR digital filter to meet design specifications has been thoroughly 
investigated!~> and may be considered to be solved. Thus, given a 
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FREQUENCY RESPONSE 





NORMALIZED FREQUENCY 


Fig. 1—Definition of low-pass filter parameters. 


specified impulse response duration of N samples, a passband cutoff 
frequency F’, (see Fig. 1), a stopband cutoff frequency F, and a ratio 
of tolerances K = 61/62 (where 6, = passband tolerance and 62 = stop- 
band tolerance), an optimal approximation to these specifications can 
be designed. The approximation is optimal in the sense that, for given 
values of N, F,, F;, and K, 6; (or equivalently 62) is minimum. The 
nature of the solution is such that there are three distinct classes into 
which it may belong, depending on the specific design parameters. 
These classes have been called extraripple filters,** scaled extraripple 
filters,’ and equiripple filters with one less than the maximum possible 
number of ripples. The differences between these classes lie in the 
number and amplitude of the ripples in the weighted error curve. The 
weighted error curve is defined as: 


1 — H(e#*!) 
E(e®*!) = lee OS/SF> (1) 
— H(e2"!) F,< f<0.5, 


where H(e?*7/) is the frequency response of the optimal filter. Extra- 
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ripple filters have (N + 5)/2 equal magnitude extrema in their error 
curves. Scaled extraripple filters also have (N + 5)/2 extrema, all but 
one of which are of equal magnitude. The third class of solution has 
(N + 3)/2 equal amplitude extrema in its error curves. Figure 2 shows 
plots of curves of transition bandwidth, AF = F, — Fy, versus pass- 
band cutoff frequency, F',, for two sets of conditions.* The data in Fig. 
2a show the curves for K = 1, 6; = 6. = 0.1, N = 9, andN = 11; 
whereas Fig. 2b shows the curves for K = 100, 5; = 0.01 (62 = 0.0001), 
N = 19,and N = 21. The minima along each curve are the extraripple 
filters. (For fixed values of 6; and 62, as in this figure, there are only 
(N — 1)/2 distinct extraripple filters.) The local regions around the 
minima are the scaled extraripple filters’ and the remainder of the curve 
represents equiripple solutions with one less than the maximum number 
of ripples. As will be shown in the next section, the first and last extra- 
ripple filters can be obtained analytically because they are simply re- 
lated to the Chebyshev polynomial of appropriate degree.*° 

As seen from Fig. 2, as F', varies from 0 to its maximum possible 
value, the transition width goes through a sequence of minima and 
maxima. The variation in the transition widths of the minima and 
maxima decreases as N increases. Furthermore, the variation in transi- 
tion width between adjacent maxima and minima also decreases as N 
increases. In fact, except for a narrow region at the beginning and end 
of the curve, the curve of transition width versus passband cutoff fre- 
quency is relatively flat over a wide range of values of Fy, 61, and K. 
Figure 3 shows a sequence of three plots of transition width versus 
passband cutoff frequency for extraripple filters (i.e., only the minima of 
the curve are plotted) of length N = 101 for various values of 6; and 
K. (The entire curves are not plotted because the amount of computa- 
tion required for a smooth curve of such high order is inordinately 
high.) Figure 3a shows a sequence of four curves for K = 1, 6, = 0.1, 
0.01, 0.001, 0.0001, whereas Fig. 3b shows the same sequence for 
K = 10, and Fig. 3c shows the sequence for K = 100. Several observa- 
tions can be made from this figure. 


(¢) For a wide range of values of F,, and fixed 6; and 62, the transi- 
tion width of the extraripple filters is relatively insensitive to 
F,. The larger the value of K, or the smaller the value of 6, 
the worse this approximation becomes. 

(iz) In the regions of either very small or very large values of Fp, 
the transition width generally decreases. 


Based on these curves and on previous results with window designs,’° 
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Fig. 2—The transition width as a function of passband cutoff frequency for two 
sets of filter parameters. 
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Fig. 3—The transition width as a function of passband cutoff frequency for 
N = 101-point extraripple filters and K = 1, 10, and 100. 


it seems fairly reasonable to expect some simple design relationships 
to exist between the five basic filter parameters, N, Fp, F's, 61, and 62 
(or K), at least in the extreme case of the Chebyshev solution, and also 
for a reasonably large region near Ff, = 0.25. Experience in practical 
situations has shown that the number of terms needed in the optimum 
FIR low-pass filter to meet design specifications is significantly less 
than the number of terms estimated by known relationships on win- 
dows.!° Therefore, the goals of this paper are to obtain approximations 
to the actual design relationships between linear-phase, low-pass filter 
design parameters and to illustrate their use in actual design examples. 

The organization of this paper is as follows. First, the design rela- 
tionships for the Chebyshev solution are derived, and approximate 
formulas for the transition width in the limit of large values of N are 
obtained. Then the results of measurements on a wide range of filters 
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where the number of passband and stopband ripples are equal are 
shown. Then minimum mean-square relative error fits to the observed 
data (for large N) assuming both linear and nonlinear dependency on 
the basic filter parameters are computed. To apply the design relation- 
ships for all values of N, a correction formula is derived, based on con- 
sideration of the appropriate transition width of the filter. Finally, 
a set of rules is presented for going from a set of desired filter param- 
eters to an estimate of the lowest-order filter which meets these 
specifications. 


1.1 Summary of Design Relationships 


Given the low-pass filter parameters F',, F';, 61, and 62 (or, equiva- 
lently, K = 6,/62), the minimum filter impulse response duration, N 
required to meet the above specifications can be estimated from the 
relation 

_ Dao(51, 52) — f(K)(Fs — Fp)? 
mia (7 7 eae 
where 


D.(61, 52) = [5.3809 xX 10-3 (logio 51)? + 7.114 X 107? (logio 61) 
_ 0.4761 | logo 62 — [2.66 xX 1073 (logic 63)? 
+ 0.5941 (logio 61) + 0.4278 ] 
and 
f(K) = 0.51244 logis K + 11.01217. 


The above relations are valid to within 1.3 percent relative error in N 
if 64 < 0.1 and 62 S$ 0.1. 


II. CHEBYSHEV SOLUTIONS 


Let {h(n), n = —(N — 1)/2, ---, (N — 1)/2} be the impulse re- 
sponse of the desired digital filter. (NV is assumed to be odd throughout 
this paper.) The impulse response satisfies the symmetry condition 
h(n) = h(—n) to give the desired linear phase. The frequency response 
of the filter is given by 


(N=1)/2 
H(e#7f) = h(O) + om 2h(n) cos (27 fn). (2) 
By making the substitution 





Xo— 1 
cos (2rf) = is (3a) 


OF) 
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ge (22+) cos ann) + (=) (3b) 


the interval 0 S$ f S 0.5 is mapped to the interval Xo = x = —1. It 


is easily shown?” that the mapping transforms the trigonometric poly- 
nomial of eq. (1) to an algebraic polynomial in x of the form 


or 





(N—-1)/2 
Ga) = YL b(n)x, (4) 


where the {b(n)} are straightforwardly related to the {h(n)}. 

The basic filter design problem is to find coefficients b(n) [or h(n) ] 
such that the weighted error of approximation is equiripple in both the 
passband and stopband. In the case when either the passband or the 
stopband has only one ripple, the solution to the filter design problem 
may be obtained analytically based on the theory of Chebyshev poly- 
nomials. In all other cases, alternative techniques must be used to ar- 
rive at the appropriate solution. 

Consider the Mth degree Chebyshev polynomial, 7 v(x), as shown in 
Fig. 4 for M = 4. The standard representation for T'y() is 


T u(x) = cos [M cos™!z], at SA (5) 
T u(x) = cosh [M cosh" x ], ess (6) 


In the interval —1 S$ x $1, T(x) oscillates between +1 and, be- 
yond the value x = +1, T(x) grows approximately as x™. If we define 
Xo (see Fig. 4) as the point where 7 u(Xo0) = (1 + 61)/62, and X, as 
the point where Ty(X p) = (1 — 61)/6:, it is readily seen that 62: T u(x) 
is a polynomial of the form of eq. (4) [with M = (N — 1)/2] which 
is equiripple in both the passband X, S x S +Xo, and the stopband 
—1<42«S +1, and hence is an optimal solution to the filter design 
problem. Of course, this solution is a special case of the general solution 
in that there is only one ripple in the passband, but at least it is an 
analytically tractable case from which a great deal can be learned about 
the relationships between the filter parameters. 

It is straightforward to solve for the points X, and Xo in terms of 
61, do, and N. If we set M = (N — 1)/2, then at x = Xo > 1.0, we get 
the relation 


and 


1 6 
T u(X0) = a 


= cosh [M cosh! Xo ] (7) 


or 


1 
Xo = cosh | 7 cosh (*F-*) |. (8) 
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145, 
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CHEBYSHEV POLYNOMIAL 
M=4 


1-3, 
52 





Fig. 4—A fourth-degree Chebyshev polynomial defining the points Xo and X>. 


At x = X, > 1.0, we get the relation 


1 — 6; 


T u(X p) _ bo 


= cosh [M cosh X, | (9) 





or 
1 4 f Sr |. 
Xp» = cosh lz cosh (> ) | (10) 


The inverse mapping of eq. (3) can be used to determine the filter 
cutoff frequencies by the relation 


= Eewsa =| 


1 
f _ on cos 
Thus, F’, (corresponding to X,) and F, (corresponding to « = +1.0) 
can be readily obtained using eq. (11) with the appropriate values for 
x. In this case, the transition width (F, — F,) can be analytically de- 
termined for all values of N, 61, and 62 as: 


a 





20 1+ Xo 


Sie (aa )I: (12) 


AF = F,— F,= Ba | eos ( 
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In the limit of large values of N (or, equivalently, 17), eq. (12) can be 
simplified by the following approximations. First, for sufficiently 
small values of a, 


2 
cosha 1+ 5, aX. (13) 


Thus, eqs. (10) and (8) can be simplified to the form [replacing M by 
(N — 1)/2] 





viele ay 
are Gra) ae). bis 


The approximation is then made that for sufficiently small e, 
cos! (1 — «) = V2e. (16) 


Thus, F, and F, are well approximated as 
“a 2 {1+ & 
Fe ~ 5- | yr cosh (*)| (17) 


obvi C$) 
= | cosh! (=): (18) 


Thus, for large N (N > 1) the approximation to the transition width 
curve is given by 


oe 7 1 Sig) ole ay 
AF = F,—F, = = 6 ee | cosh (4% ) 


Z | [cosh (+E) [P [cosh (# =i (19) 


(This approximation is valid to within 1 percent for N = 51.) Equation 
(19) shows AF to be inversely proportional to (NV — 1). This identical 
inverse behavior has been noted previously for filters designed by 
windowing.!® These and other considerations lead one to consider as 
a performance measure of a low-pass digital filter the quantity D de- 
fined as 





D=(N= DAF SW =D, =F (20) 


which, in many cases, depends only on 6; and 4:. 
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The curves in Fig. 5 (for N = 127) show plots of performance D 
versus 20 logio (62) both for values of K = 6,/52 from K =1 to 
K = 1000 and for values of 6; from 0.5 to 0.0001. The behavior of these 
curves is as predicted from eq. (19) and from intuition about the be- 
havior of D as 6; and 6, get large. It is clear that when 6, + 6, = 1.0 
[ie., 6, = K/(K + 1), & = 1/(K + 1)], then D = Osince there is no 
transition band. In this case, the term cosh! (1 — 61)/62 vanishes, and 
the first and second terms in eq. (19) cancel exactly. In the limit of 
small values of 61, the second and third terms in eq. (19) are approxi- 
mately equal and cancel. Thus, D is approximately of the form 


D = 2 cosh7! (+=*). (21) 
vin 62 
Since 
cosh («) = In (w + va? — 1), (22) 


eq. (21) can be rewritten as (assuming 6, negligible) 
~ + (In2 — In és) (23) 


which is independent of 6,. Thus, as seen in Fig. 5b, in the case of small 
61, D is essentially independent of 6). 

The curves of Fig. 6 show the behavior of D for various values of 
N S 127, for K = 1, 10, and 100. The values of N used for these plots 
were N = 3, 7, 11, 19, 51, and 127. The differences between the data 
for N = 127 and the data for N = 51 are relatively small. These 
curves also exhibit another interesting phenomenon. The curve for 
N = 8 saturates at a value of D = 1. This is due to the limitation that 
the transition width, AF, must be less than or equal to 0.5. Thus, the 
saturation value of D is (N — 1)/2, or 1.0 for N = 3. The larger the 
value of K, the larger the value of 62 beyond which the curve for N = 3 
saturates. 

In the case of the Chebyshev solution to the optimal filter design 
problem, a formula can be derived for the impulse response duration 
N of a filter whose response meets specified values of 6;, 52, and F,. 
Since F,, is not specified, this result is useful only as a first guess of 
a value of N which meets specifications on all four filter parameters. 
From the discussion given earlier in this section, f = F, when z = 1. 0: 
Thus, eq. (11) can be used to solve for Xo as 


3 — cos (27F.) 


Xo = 1 + cos (27F,) 


(24) 
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CHEBYSHEV SOLUTIONS 
N= 127 
K VARYING 


PERFORMANCE D 


6, VARYING 
5, = 0.5, 0.2, 0.1, 0.05, . . ., 0.0005 





20 LOG ig (55 ) 


Fig. 5—Plots of D versus logio 52 as a function of K and 6; for the N = 127 Cheby- 
shev solutions. 
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Fig. 6—Plots of D versus logio 52 as a function of N for K = 1, 10, and 100, for 
the Chebyshev solutions. 
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Equation (8) may now be used to solve for N as 


2 cosh7! (=*) 
Ne ee ee (25) 


_, | 3 — cos (Q7F,) 
1 
cosh E + cos (QzF.) 


Using trigonometric identities, eq. (25) can be simplified to the form 


ecosh7! (Gi ) 
de 


1 
-1 
cot (Sater) 


For these values of X) and N, F, may be obtained from eqs. (10) and 
(11). If the transition width obtained is too large (too small), N is then 
decreased (increased) until the desired specifications are approximately 
achieved. (Since there is only one Chebyshev solution for fixed 6, 5:, 
and N, exact values of both F, and AF cannot usually be obtained.) 
As will be seen later, eq. (26) forms the basis for estimating a lower 
bound on the filter order required to meet given design specifications 
on AF, 61, and 6s. 

As mentioned earlier, Chebyshev polynomials can be used as the 
optimal solution in the case when there is one ripple in either the pass- 
band or the stopband. To see how this second case can be handled, con- 
sider the Chebyshev polynomial T(x) where T'(Xo) = (1 + &)/d 
and M = (N — 1)/2. As shown earlier, this polynomial represents an 
optimal filter with passband ripple &, stopband ripple 6;, passband 
cutoff frequency F',, and stopband cutoff frequency F,. Consider the 
function RF y(xz) defined as 


(26) 


Ru(x) =1—- iT u(—2x + Xo i 1), (27) 


where —1 S$ x S Xo. An examination of the properties of Ras(x) 
shows 


(t) Ru(x) is a polynomial in x of degree M. 
(22) In the interval Xo — 2 S$ x S Xo, Rxu(x) oscillates between the 
values 1 — 6,, and 1+ 4. 
(zit) In the interval -—1 S 2S Xo —1—X,, Ru(x) goes from 
—§, to bo. 


If we define F, and F, as the equivalent filter cutoff frequencies, then 
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it is readily shown that 


fF, =0.5 —F, 


28 
BF, = 0.5 — Fp. ie) 


Thus R u(x) is a polynomial with only one stopband ripple which satis- 
fies the filter optimality criterion. In summary, to design Chebyshev 
approximations with only one stopband ripple, one merely makes the 
substitution 


61 = b, 
bo = 5 
2 1 . (29) 
F,=0.5 — F, 
F,=0.5 — F, 


and solves for an equivalent filter with one passband ripple using the 
formulas of this section. 

The data on the Chebyshev solutions provide valuable insights into 
the behavior of the design relationships between filter parameters in 
more general cases. These data and their design relationships are pre- 
sented in the next sections. 


III. MEASUREMENTS OF D 


Earlier, it was shown that the transition width or, equivalently, the 
performance measure D for the Chebyshev solutions (i.e., either small 
F, or large F,) was generally significantly smaller than for most of the 
range of values of passband cutoff frequency. To obtain data on a more 
realistic set of filters, the value of D was measured for a large number of 
extraripple filters designed with the constraint that the number of pass- 
band and stopband ripples were the same. In this manner, the passband 
and stopband widths were almost equal and, thus, the measured data 
would characterize the parameter set over as wide a range of values of 
61, 62, and F’, as possible. Six different values of N were used including 
N = 8, 7, 11, 19, 51, and 127. Over 1500 filters were designed to 
cover the parameter range 0.00001 S$ 6. $ 0.5, and 1 S$ K S 500 
(K = 61/62). Figures 7a through 7i show plots of D versus 20 logio (62) 
for the nine values of K and the chosen values of N. All these data are 
presented since they are fairly general and may be useful in a wide 
variety of contexts other than this paper. (Also, their measurement re- 
quired almost 3 hours of computer time on a fairly fast processor.) 

It is remarkable how similar the plots of D versus log 62 for the more 
general case of Fig. 7 are to the identical plots for the Chebyshev solu- 
tions. Similar behavior for small D is expected, since D tends to 0 as 
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6; + 62 tends to 1.0 independent of N, Fy, and F,. However, the ap- 
proximately linear behavior of D as a function of log 62 and log K (for 
large NV) is unexpected. Another similarity between the two cases is the 
independence of D of (N — 1) for large N. The tendency of D to 
saturate for small values of N is yet another similarity between the 
curves. The main difference between the sets of curves is that, in the 
Chebyshev case, D is approximately independent of 6, for small 61. 
This behavior is not observed for the extraripple filters of Fig. 7. 

A summary of the behavior of D for N = 127, as a function of log 6. 
for various values of either K or 6, is presented in Fig. 8. The values of 
6, used were 0.5, 0.2, 0.1, --- , 0.00002, whereas K ranged from 1 to 
500 as in Figs. 7a through 71. In some sense, this figure represents a set 
of design curves for high-degree low-pass filters. In the next section, 
we show how the data of Fig. 8 can be approximated by linear and non- 
linear fits, and how simple modifications can be made to correct the 
results for values of N less than 127. 


IV. DATA-FITTING PROCEDURES 


In order to make most efficient use of the data of the previous sec- 
tion in a practical design problem, it is useful to express the relation- 
ships between the filter parameters in a simple manner. Since we know 
of no way of deriving exact analytical formulas, as in the Chebyshev 
case, a minimum mean-square relative error fit to the data over a re- 
stricted but reasonable range was sought. Both a linear and a nonlinear 
fit to the data of Fig. 8 (N large) were obtained. Corrections for smaller 
values of N were then obtained giving a complete set of design rules. 

The data of Fig. 8 suggest that except in the region D = 0 (large 
values of 61) a simple linear fit can be obtained. The curves of Fig. 8 
were assumed to be of the form 


Diutn = @ + b login (62) + ¢ logio K, 
a = —0.803 
; 30 
b = —1.359, ey) 
ec = —0.737. 


where Dy wn is the predicted value of D. The values for a, b, and c were 
chosen to minimize the sum of the squares of the relative differences 
between Drury and D for the N = 127 data for values of 6; in the range 
0.01 = 61; 0.01 = 62. The reason relative rather than absolute errors 
were considered is that a fixed percentage error in D, 5D, approximately 
gives a fixed percentage error in N, 6N, when transition width is held 
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20 LOG y (dq ) 


Fig. 7—Plots of D versus logio 52 as a function of N for K = 1, 2, 5, 10, 20, 50, 100, 
200, 500 for the case of extraripple filters with the same number of passband and 
stopband ripples. 
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Fig. 7 (continued). 
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= 127 data of Fig. 7 showing curves for 


Fig. 8—Plot of D versus logio 52 for the N 


various values of K and 6. 
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— 1) 
—1 


sD _ 
D 


Figure 9 shows a plot of the relative error of the predicted values of 


Except for 
less than 1.0 


0.00002. 
percent for the entire range of 6, and 6; considered. Based on a value of 


0.005, 0.002, ---, 


) 


for 6; = 0.01 


a small region on the curve 61 


D versus log 52 


the relative error is 


? 


0.01 


equivalent to an error 


= 127, a relative error of 1.0 percent in D is 
of 1.26 samples in N, or approximately one-half a filter order off from 


the correct order. Errors of this magnitude are generally considered to 


be quite small, i.e., the prediction is reasonably good. 


N 


In an effort to improve the fit and extend the range of applicability 


of the approximation, a nonlinear formula was chosen for D. Based on 


the data of Fig. 8, it was observed that the slope of the curve of D 


The simplest approxi- 


mation was to try a fit which was linear with respect to log 62. and 


versus logio 62 changes nonlinearly with log 63. 
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quadratic in log 6;. Such a fit is of the form 


Dy = [ar(logio 61)? + ae logio 61 + a3 ] logio de 
+ [as(logio 61)? + as login 61 + a6]. (32) 
The constants a; to as were chosen to minimize the mean-square rela- 


tive error for the N = 127 data over the range 0.1 = 6; 2 0.000001, 
0.1 = 62 = 0.000001, and turned out to be 


a, = 5.309 X 1073 
a2 = 7.114 K 107 
a3 = —4.761 X 107} 
—2.660 X 107° 
—5.941 X 107? 
ag = —4.278 X 107}. 


a 
> 
I 


Q 
cr 
I 


Figure 10 shows the relative error of the predicted value of D as a func- 
tion of log 6 for values of 6; from 0.1 to 0.00002. The peak percentage 
error is 1.3 percent, and over most of the range the percentage error is 


OPTIMUM LINEAR FIT 
N=127 DATA, Ny =N, 


RELATIVE ERROR OF PREDICTED D IN PERCENT 





100 -90 80 -70 ~60 50 —40 
20 LOG 9 (85) 


Fig. 9—The relative errors in fitting the data of Fig. 8 with a linear curve over the 
range 6, S$ 0.01, for various values of 6; (NV = 127). 
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N = 127 DATA 
OPTIMUM NONLINEAR FIT 


RELATIVE ERROR OF PREDICTED D IN PERCENT 





—100 -90 80 ~70 60 —50 40 -30 ~20 
20 LOG 19 (59) 


Fig. 10—The relative errors in fitting the data of Fig. 8 with a nonlinear curve over 
the range 6, S 0.1, for various values of 6; (NV = 127). 


much smaller. Clearly, this prediction formula is acceptable for almost 
any design application. 

Figure 11 shows a summary of the predicted values of D as a func- 
tion of stopband attenuation for a wide range of values of passband 
ripple. In this case, passband ripple in dB is defined as 








Passband ripple = 20 logio (=) ; (33) 
Pp 
Stopband attenuation = —20 logio (4s), (34) 
where 
261 
and 
oe (36) 
* 14+ 6 


These data correspond to standard design data for continuous-time 
filters where the frequency response magnitude is constrained to be less 
than or equal to 1.0. 
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PERFORMANCE D 





STOPBAND ATTENUATION IN dB 


Fig. 11—Plots of D versus 20logioé2 for various values of the parameter 
20 logio (1/1 — 6,), where 6, = 25;/(1 + 61), as calculated from the optimum non- 
linear fits to the data of Fig. 8 (NV = 127). 


V. CORRECTIONS FOR SMALL VALUES OF NV 


The formulas in the previous section are accurate for predicting D 
(or, equivalently, NV) for values of N greater than about 51. As seen from 
the curves in Fig. 7, as N decreases, D decreases for fixed values of 52 
and K. It is also seen from Fig. 7 that the differences increase with 
decreasing 62 or, equivalently, increasing transition width. An examina- 
tion of the relative deviation of Dy, the predicted value of D, from its 
true value as a function of transition bandwidth showed that, inde- 
pendent of N, the deviations could be simply approximated by a curve 
of the form 


Door = f(K)(AF)?, (37) 
where Dor is the correction term and f(K) is of the form 
f(K) = (0.51244 logio K + 11.01217). (38) 


(The constants in eq. (38) were again obtained by a minimum mean- 
square relative error data-fitting procedure.) Thus, using eqs. (37) and 
(38), a formula for D which depends on N, K, and 62 can be obtained. 
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Adopting the notation 


(2) D. (52, K) = Dyt of eq. (32)* 
(2) D(S2, K, N) = predicted value of D as a function of N as well 
as 62 and K 


(zit) D(Se, K, N) = true value of D, 
we obtain the relation 
Deo(52, K) — D(52, K,N) = f(K)(AF)?. (39) 


Thus, in a design case where 62, K (or 6;), Fp, and F, (i.e., AF’) are 
specified and the problem is to estimate N, the impulse response dura- 
tion required to meet these specifications, eq. (89) can be used directly 


since : 
D(62, K, N) = (N — 1)AF. (40) 


Thus, combining eqs. (37) and (88) and solving for N gives 


_ Deli, K) — fKYAR)? 5 


- AF 


(41) 

A closed-form expression for D(é., K, N) may be obtained by sub- 
stituting eq. (40) for D(62, K, N) in eq. (39) and solving the quadratic 
equation for AF. Equation (40) is then used to give D(52, K, N). Thus, 


we get 
f(K)(AF)?2 + (N — 1)AF — D.(b2, K) = 0 (42) 


pa N= 0( [py TODA RE _ 
ar = Sag (Vit age ot) 
Bees, KW) = Saag ( 1 1): (44) 


In the limit, as N tends to infinity, eq. (44) shows D(é2, K, N) tends to 
D..(62, K) as expected. 

Using eq. (44), the relative error of the predicted value of D from the 
true value was measured for the data for N = 3, 7, 11, 19, 51, and 127 
with 6, S 0.1. The relative errors for values of K from 1 to 500 are 
plotted in Figs. 12a through 12f. In all cases, the worst relative error in 
D is sufficiently small that the equivalent error in N is less than one 
sample. Thus, for all practical purposes, the design equations above 
serve as a useful guide for estimating the order of the filter required to 
meet design specifications. 


* Eq. (32) gives Dnx as a function of 6; and 52 but since 6; = Ké:, it is also im- 
plicitly a function of 52 and K. 
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VI. SUMMARY OF DESIGN PROCEDURES AND EXAMPLES 


Based on the results presented in this paper, it is now possible to give 
a set of rules for estimating the filter impulse response duration re- 
quired to meet given design specifications. These rules are as follows: 


(2) 
(it) 


(722) 


(zw) 


Check if either 6, or 62 is greater than 0.1, in which case the 
graphical data of Fig. 7 are used directly to estimate N. 
Calculate a value of N, call this N,, corresponding to the extra- 
ripple case where N, = N;, from the equation 


_ Da(61, 62) — f(K)(AF)? 
Ni = AF nae 





where D,,(6, 62) is the optimum nonlinear fit to the data for 
large N and is given by: 


D.(61, 62) = [5.309 x 1078 [logio 6; |? + 7.114 x 10-2 logo 61 
= 0.4761 | logio 62 — [2.66 xX 1073 (logio 61)? 
+ 0.5941 logio 61 + 0.4278 ] 
and f(K) is given by 


f(K) = 0.51244 logy (K) + 11.01217. 


If the desired value of F’, is less than or equal to 0.04 (let us 
call this case 1), or if the desired value of F’, is greater than or 
equal to 0.46 (case 2), then the estimate of N is obtained in 
rule (zv). Otherwise, the value Ni of rule (22) is used as the 
estimate of N. 

To obtain the value of N,. for the Chebyshev solution for case 
1, eq. (26) is used to get a first approximation to N,. N, is then 
systematically varied until a Chebyshev solution is obtained 
which meets specifications on 6;, 62, and AF. (As discussed 
earlier, it is not generally possible to find a Chebyshev solution 
which meets specifications exactly on all four filter parameters.) 
For case 2, 6; and 62 are interchanged, and F;, is replaced by 
0.5 — F,,, in order to solve for N.. from eq. (26). In cases where 
this rule is applied, the value N.. obtained is a lower bound to 
the true value of N;. No upper bound may be given in this case. 
A discussion of this problem is given below. 


Several comments are necessary about these rules before proceeding 
to some examples. The discussion in this paper has concentrated on two 
regions of the curve of AF versus F',— the Chebyshev solutions and the 
case of extraripple filters with an equal number of passband and stop- 
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RELATIVE ERROR OF PREDICTED D IN PERCENT 





20 LOG 49(55) 


Fig. 12—Plots of the relative errors in fitting the data of Fig. 7 using the corrected 
values of D for N = 3, 7, 11, 19, 51, and 127. 


793 


OPTIMUM FIR LOW-PASS DIGITAL FILTERS 





a 
N o 9 
pans 


4AN3045d NIG G3LOIGSYd JO YOuYS SAILV1SY 





—40 —30 -20 —10 


50 


—60 


—70 


—80 


—90 


—100 


20 LOG 19(5) 


Fig. 12 (continued). 
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band ripples. The justification for such emphasis was the results shown 
in Figs. 2 and 3 which indicated that the extraripple solution for 
N, = N, was fairly representative of a large region of the curve of 
AF versus F',, except in the case where F', was either very small or very 
large, in which case the Chebyshev solution became important. Also, 
as seen from Fig. 2, between extraripple solutions, the curve of AF 
versus F', peaks up. However, it is seen that in many cases one can 
“approximately” bound the maximum between extraripple solutions 
by the next lower-degree extraripple solution. Since the design equa- 
tions can predict AF for this case (the next lower-degree extraripple 
solution), a good bound on N can be obtained for a reasonably large 
region of the curve of AF versus Fy. In these cases, the value given by 
rule (22) is good to within +4 in the worst cases, i.e., large values of 
K, and generally to within +2. 


TRANSITION WIDTH (AF) 





TRANSITION WIDTH (AF) 


PASSBAND CUTOFF FREQUENCY (F.,) 


Fig. 13—Explanation of the types of behavior of curves of transition width versus 
passband cutoff frequency. 
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In the region of the Chebyshev solutions, however, such bounding 
procedures no longer are valid. An explanation of the difficulties which 
may be encountered is given in Fig. 13, which shows two types of curves 
of AF versus F,. Fig. 18a shows the case, discussed above, where the 
extraripple solutions of impulse response duration N — 2 are approxi- 
mately midway between extraripple solutions of impulse response dura- 
tion NV. In this case, the next degree solution bounds the maximum AF 
between extraripple solutions. Figure 13b shows the case where the 
extraripple solutions of impulse response duration N — 2 have ap- 
proximately the same values for AF and F,, as the extraripple solutions 
of impulse response duration N. In these cases, there is no good bound 
on the maximum value of AF between adjacent extraripple solutions. 
The case of Fig. 13b corresponds to regions of F', near 0.0 and F, near 
0.5, i.e., in the regions of the Chebyshev solutions. In these cases, as 
discussed in rule (zv), there is only an underbound on JN, and no 
overbound. 

The choice of a value of 0.04 in rule (777) as the width of the region 
during which the behavior of N can only be underbounded was ob- 
tained from the data of Fig. 3 which shows that, beyond this region, the 
variation in the values of AF for extraripple solutions is small. 

Figures 14 through 16 illustrate typical behavior of the curve of 
minimum value of impulse response duration N, to meet given specifi- 
cation on 61, 62, and AF as a function of Fy. Figure 14 shows data for 
the case 6; = 0.01, 62 = 0.0001, AF = 0.158. The value of N; from 


27 


6, = 0.01 


nN 
a 


6 = 0.0001 


ND 
Ww 


AF = 0.158 


FILTER LENGTH IN SAMPLES (N) 
nN 





0 0.04 0.08 0.12 0.16 0.20 0.24 0.28 0.32 0.36 
PASSBAND CUTOFF FREQUENCY (Fy) 


Fig. 14—Optimum values of N to meet given design specifications on 61, 52, and 
AF, as a function of Fp. 
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5, = 0.01 


6, = 0.0001 


AF = 0.032 


FILTER LENGTH IN SAMPLES (N) 





0 0.1 0.2 0.3 0.4 
PASSBAND CUTOFF FREQUENCY (FA) 


Fig. 15—Optimum values of NV to meet given design specifications on 51, 52, and 
AF, as a function of F>. 


rule (iz) is 21, and the values of N, are 19 (for small F,) and 13 (for 
large F,). Over the region 0.06 S$ F, S 0.32, N is within 2 of the nomi- 
nal value of N, = 21. Over the entire range of F,, N is within 4 of the 
value NV; = 21. The data of Fig. 14 correspond to the case of Fig. 13a. 

Figure 15 shows data for the case 6, = 0.01, 62 = 0.0001, AF = 0.032. 
The value of N,; from rule (zz) is 101, and the values of N,. are 95 (for 
small F,) and 55 (for large F,). Over the region 0.1 S$ F, S 0.38, N is 
within 2 of the nominal value of N; = 101. However, in the region 
0 Ss F, S 0.036, the value of N fluctuates from a minimum of 95 (the 
Chebyshev lower bound) to a maximum of 117. The explanation of this 
erratic behavior of N is seen in Fig. 16. Figure 16a shows the data of 
Fig. 15 on an expanded horizontal scale, and Fig. 16b shows a plot of 
the approximate curves of AF versus F,, for all values of N from 95 to 
117. The heavily traced parts of these curves show the lowest-order 
solution which just meets specifications on AF. From Fig. 16b, it is 
clear that in the vicinity of the first few extraripple solutions, the curves 
of AF versus F', are exceedingly steep. Hence, a slight change in F, 
greatly increases the required order solution to meet specifications. In 
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5, =0.01 
5, = 0.0001 
AF = 0.032 


FILTER LENGTH !N SAMPLES (N) 





0.040 


0.038 


TRANSITION WIDTH (AF) 
fo) 
2 
@ 
NR 
a 





0 0.004 0.008 0.012 0.016 0.020 0.024 0.028 0.032 0.036 
PASSBAND CUTOFF FREQUENCY (F,) 


Fig. 16—Explanation of the behavior of the data of Fig. 15 in the region 0 S Fp 
S 0.036. 


these cases, it is impossible to estimate the exact filter impulse response 
duration which is required. Instead, only a lower bound can be given. 
Fortunately, as seen in Fig. 15, the regions in which this erratic be- 
havior can occur are limited. 

We conclude this section with a set of examples which illustrate the 
use of the design rules. 
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Example 1: Find the minimum value of N required to meet the specifi- 
cations 6; = 0.05, 62 = 0.0001, F, = 0.19, F, = 0.21. 


From rule (77) we get N; = 129.7 which is rounded to 129. (Herein, 
all values of N will be rounded to the nearest odd integer.) Since F, 
is far from the range for the Chebyshev solutions, the value of 129 is 
used as the appropriate estimate. The actual filter impulse response 
duration required is N = 1381, although the N = 129 filter just missed 
meeting specifications. 


Example 2: Find the minimum value of N required to meet the specifi- 
Ications 6; = 0.01, 6. = 0.0001, F, = 0.213, F, = 0.373. 


From rule (77) we get N; = 19. Since F, is again out of the range of 
the Chebyshev solutions, the value 19 is used as the estimate of N. 
The actual filter impulse response duration required is N = 19. 


Example 8: Find the minimum value of N required to meet the specifi- 
cations 6, = 0.1, 62 = 0.1, Fp = 0.12, F, = 0.19. 


From rule (77) we get Ni = 11. Since F,, is out of the range of the 
Chebyshev solutions, the value 11 is used as the estimate of N. The 
actual value of N is 11. In this case, it is interesting to note that the 
value of N, for the Chebyshev solution is also 11. This example points 
out that, for filter specifications leading to small values of N, there is 
very little variation in the actual value of N as F’, varies. This observa- 
tion has been made earlier with respect to Fig. 14. 


Example 4: Find the minimum value of N required to meet the speci- 
fications 6; = 0.01, 62 = 0.0001, F, = 0.36, F, = 0.497. 


From rule (zz) we get Ni = 238. Since F, is within the bounds of rule 
(17z) (case 2), N. is computed from rule (zv) as 13. The actual value of 
N is 19. In this case, the lower bound is within three filter orders of the 
true solution. 


VII. SUMMARY 


This paper has presented a wide variety of data on the relationships 
between design parameters for optimal low-pass FIR linear-phase digi- 
tal filters. Analytical formulas were derived for the Chebyshev solu- 
tions, i.e., when there was only one passband or stopband ripple. Ap- 
proximate fits to the data using nonlinear relationships between AF, 
N, 63, and 62 were given in the case where the number of passband and 
stopband ripples was equal, and it was argued that these relationships 
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were valid over a wide range of values of the filter parameters. Finally, 
a simple set of rules for estimating the minimum value of N, which 
meets given specifications on F », F’,, 61, and 62, was discussed. Examples 
were given to illustrate the application of the rules. 
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Impulse Response of Clad Optical 
Multimode Fibers 


By D. GLOGE 
(Manuscript received January 9, 1973) 


Loss, coupling, and delay differences among the modes of multimode 
fibers influence thetr response to intensity-modulated optical signals. This 
“baseband” response is derived here from a time-dependent continuous 
description of the power flow in the fiber. Particular attention is given to 
the output as a function of angle and to the impulse response, its width 
and symmetry. We find that coupling narrows the impulse response 
but, at the same time, causes additional loss. Under practical condi- 
tions, this loss may limit the usefulness of coupling for the purpose of 
reducing the mode dispersion. We calculate a possible data rate of 12 
Mb/s for a 10-km repeater spacing and an effective numerical aperture 
of 0.1, but we show that further «mprovements can be gained from an 
optimization of the coupling characteristic and of other parameters. 


I. INTRODUCTION 


Although single-mode operation of clad optical fibers is possible and, 
in general, offers very good transmission characteristics, multimode 
fibers have two advantages: They impose less stringent requirements 
on the optical carrier (they transmit even the incoherent light from 
a luminescent diode) and their larger dimensions alleviate splicing 
problems or at least relax the tolerances required for connection. 
Typically, the core diameter is of the order of a hundred wavelengths 
and the fiber therefore transmits thousands of modes, even if the index 
difference between core and cladding is only a few percent (corre- 
sponding to a numerical aperture of 0.2 to 0.3). 

The usefulness of such fibers depends on their dispersion characteris- 
tics. Delay differences among the many modes? distort the signal and 
certainly produce a signal response inferior to that of the single-mode 
fiber. For certain systems, on the other hand, overall system economy 
may place the desirable information rate of individual fiber channels 

801 


802 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1973 


in a range where the characteristic signal response of multimode fibers 
is adequate (below 100 megabits per second, say). In such systems, 
multimode operation would surpass single-mode operation because of 
the advantages mentioned earlier. 

Effective use of the multimode fiber would presuppose the excitation 
of a large number of modes right at the input with the objective of 
transmitting all of these modes to the receiver. Experiments that have 
approximated these conditions have revealed a rather intricate re- 
sponse to short input pulses both in liquid- and solid-core multimode 
fibers.2:3 For example, the width of the output pulse did not increase 
linearly with fiber length, but showed a less-than-proportional increase 
for long fibers. Coupling among the modes and a dependence of loss on 
mode number seemed to play a part.* In some fibers, this resulted in an 
optimal mode distribution (causing lowest overall loss) which com- 
prised only a fraction of the modes capable of propagating. Measure- 
ments of the coupling strength showed that a total exchange of power 
between two modes was likely to occur within less than a meter of 
fiber.’ This result made it clear that a perturbation theory depending 
on small coupling rates was not applicable. A closed and unrestricted 
description was achieved by assuming a modal continuum rather than 
thousands of individual modes. In this theory, mode coupling took the 
form of a diffusion process not limited to small coupling amplitudes. 

The work discussed here extends the approach outlined in Ref. 5 by 
taking the velocity differences among the modes into account. We first 
consider fibers in which the optimal (steady-state) mode distribution 
does not include modes close to cutoff. The power in the fiber is calcu- 
lated as a function of time and output angle (mode number) for the 
case of a short input pulse. Particular attention is given to the “fiber 
impulse response”’ obtained by integrating over all angles at the out- 
put. A simple formula relates the output pulse width to the fiber length 
and to the attenuation and coupling parameters. The latter can be 
measured in short samples: (a few meters in length) permitting the 
immediate computation of the pulse broadening in a long fiber and, 
hence, of the obtainable data rate for a given fiber length. 

The results may also shed some light on the prospects of mode cou- 
pling introduced artificially as a means of equalization: It has been 
predicted that, under certain circumstances, increased mode coupling 
reduces the signal distortion (ultimately forcing all energy to propagate 
at an average velocity).*:”7 The objective of this paper is to outline an 
analytic approach which can answer these and other questions. 


OPTICAL MULTIMODE FIBERS 803 


II. TIME-DEPENDENT POWER FLOW EQUATION 


The differential equation obtained in Ref. 5 for the power flow in 
multimode fibers was originally derived from the mode characteristics 
by assuming certain statistics for the modal field coupling. By making 
some approximations acceptable for high-order modes, a description 
results which admits a simple ray-optics interpretation. Each high- 
order mode can be represented by a characteristic ray propagating 
inside the core along a meridional zigzag path. Internal reflection guides 
the rays at the core-cladding interface and limits the range of angles 
which can be formed with the guide axis. If n and n, are the indices of 
core and cladding, respectively, and 


Ne 
A=1-" (1) 


is a small difference, the maximum angle is given by the condition of 
critical internal reflection which is approximately 


Omax = V2A. (2) 


The rays form a uniform distribution within the cone of apex angle 
Ores 

If the core cross section permits many modes to propagate, the rays 
are so densely spaced that their distribution can be considered as con- 
tinuous. The state of the fiber at a point z and at time ¢ can then be de- 
scribed by a distribution P(@, z, 4) where @ is a continuous variable.* 
Reference 5 expresses the incremental change dP in the power P as 
a sum of two terms: 


(t) A loss —A6?Pdz; this term comprises attenuation effects in the 
cladding and the core-cladding interface and increases as the 
square of the characteristic angle @. The coefficient A is measured 
in m~'rad-?. 6-independent loss is omitted, but can easily be 
incorporated later in the final solution. 

(it) Mode coupling; in practical multimode fibers, coupling was 
found to occur essentially only between closely adjacent modes 
and, for this reason, takes the form of a diffusion process in the 
ray picture. The incremental increase in P(6, z) as a result of 
diffusion is (1/6)(0/00)(@DdP/d0)dz, a term typical for radial 
diffusion in cylindrical configurations. D is a coupling coefficient 


*6 is related to the transverse wave number u of the corresponding mode by 
u = nko where k is the vacuum wave number. 
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which, for most of the following discussion, is assumed to be 
independent of 6. 


The total variation in P thus becomes 


aP 
— —Ag? See 
dP Ao*Pde + 5 5, ( oD a; ) a (3) 
If P is a function of time t, we can also write 
oP oP 
dP = 9g Ue + a (4) 
Equating (3) and ae and dividing by dz results in the equation 
dtaP _ 5 or 


The derivative dz/dt is the velocity of the power P(@) or, equiv- 
alently, the group velocity of a mode with characteristic angle 6. By 
using the relation between 6 and the transverse wave number u, we 
can calculate this velocity from eq. (25) of Ref. 8. Except for the few 
modes close to cutoff, we obtain the simple relation 


dz Cc 


di nl + 8%) (6) 


It relates the mode velocity to the vacuum light velocity, c, reduced by 
n because of the material retardation and by a factor 1 + 6?/2 which 
accounts for the increased path length as a result of the zigzag propaga- 
tion. The derivative dt/dz required in (5) is the inverse of (6) and has 
the meaning of a delay per unit length. If we ignore the delay n/c 
common to all modes (it can be added later if necessary), we obtain 
from (5) and (6) 


Ec uanee eO(y). 
Be ee x58 +336 ( 0 Sp oe 
With the help of the Laplace transformation 
p(0,2,8) = f° e-"P(6, 2, t)dt, (8) 
0 

we can write (7) in the form 

8P _ _ 44292 2) 

dz rts 6 a6 55 (8D (9) 


where 
o = (1 + ns/2cA)?. (10) 
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Except for the factor o?, (9) agrees with (22) of Ref. 5; we can there- 
fore use the solution derived there if we replace A by Ao’. For the 
Gaussian input distribution 


Din = f(0, s) exp (— 67/03), (11) 
we obtain 
p(8, zy s) = fz, s) exp [—6?/0(z, s) ] (12) 
where ietian es . 
. _ OF<£00, + O% tanh cyoz 
O%2, 8) = a ©% + oO? tanh cyaz (18) 
and 
f(0, s)c@? 
fz, 8) = ©2, sinh cy..2 + ¢ ©? cosh cyo2 (14) 
with 
©. = (4D/A)* (15) 
and 
Yo = (4DA)}. (16) 


For ew excitation (s = 0), the angular width @(z, 0) changes mono- 
tonically from ©, to ©, as 2 increases. The width ©, characterizes 
a distribution which propagates unchanged (at steady state) and with 
the minimum overall loss coefficient y.. It seems practical to excite 
this distribution right from the beginning. The condition 0, = 0,, 
will therefore receive particular attention in the following. The solu- 
tions (12) through (16) assume that ©, and ©, are so small compared 
to Omax = V2A that practically no light propagates at angles close to 
the critical one. In other words, modes close to cutoff do not take part 
in the transmission process. Experiments have shown that the steady 
state in certain liquid-core fibers (C2C¢, in quartz, for example) is of 
that type. 

Closed-form Laplace transformations of (12) exist only for the 
approximations given in the limits z < 1/y.. and z > 1/y.~ and these 
two cases are discussed in Section III. Certain important character- 
istics of P(@, z, t), however, can be derived for all z, as we shall see in 
Section IV. 


IiI. CLOSED-FORM SOLUTIONS FOR THE IMPULSE RESPONSE 


In a practical communication system, the multimode fiber is likely 
to be fed with a pulse F'(0, t) whose width is typically of the same order 
as the broadening expected in the fiber. Its Laplace transform /f(0, s), 
appearing in (12) and (14), and its dependence on s therefore cannot 
be ignored. We assume, however, that the input is simultaneous in all 
modes being excited, in which case ©, is independent of s. Sacrificing 
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generality for clarity, we restrict this discussion to the practical input 
condition ®, = ©,; the general case can be treated in exactly the same 
way, but leads to more complicated results. 

In the case of a short fiber, we replace sinh cy,.z and tanh cyqz in 
(13) and (14) by the argument cy..z and set cosh cy,.zg = 1. With the 
help of (10), (15), and (16), (12) then becomes 





Soe = foe SY sais | -8 (at) (17) 


Yoke 
which has the Laplace transform 


exp (—6?/@%) 
1 + Yo% 


The denominator 1 + y.z expresses the loss in the short distance 
z; exp (— 62/@2) indicates that the input condition has been conserved, 
and F(0, t — n6?z/2c) shows that the portion of the input pulse F(0, #) 
which propagated at an angle 6 was delayed by né6?z/2c. Clearly, 
coupling has not affected the propagation at this distance. 

The total output is obtained from the integration® 


PO,2;0) = F(O, t — n6?z/2c). (18) 


d= Se ix (8, 2, 8)6d8. (19) 


For 2K 1/y.., we obtain with (17) 


_ rf(0, s) ©2 | 
1 OF ye2)(1 + n@2z8/20) 
If we now set f(0,s) = 1, which corresponds to an infinitesimally 


short input pulse of energy 1, the Laplace transformation of (20) yields 
the impulse response of the fiber: 


(20) 


2cr 
na(1 + yo2) 


The assumption of a mode continuum has the consequence that the 
impulse response is a continuous and well-behaved function, in spite 
of the somewhat artificial condition of an infinitesimally narrow input 
pulse. That Q extends mathematically to infinity results from the as- 
sumption of the unbounded distributions (11) and (12). Remember 
that this assumption was acceptable since 0, = 0. < max. The same 
condition limits Q(é) practically to a time interval narrower than 
NO%.x2/2c, which is the delay between the fastest and the slowest mode. 
Since (18) and (21) neglect mode coupling, they could have been ob- 


Q(z, t) = exp (—2ct/n @2z). (21) 
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Fig. 1—Impulse response according to (24) normalized for equal peak values and 
plotted versus normalized time for different fiber lengths. 


tained without the help of the power flow equation. They were derived 
here merely for a better understanding of the physical implications 
involved. 

In the case of a very long fiber, we may use the approximation 
tanh oy.z = 1 and sinhoy,z = coshayez = 4 eXPp cyYez in (13) and 
(14). Equation (12) thus assumes the form 





2 
P= paz exp [—0(09/82 + Y02)] (22) 
which leads to 
270? 
q = Tag xP (— 2702) (23) 


where p is integrated over all angles @ with the help of (19). After intro- 
ducing (10) for o into (23) we can form the Laplace transform of q(s). 
By using the condition y.z > 1, we arrive at 


_ T t 1\" weed” 4 
Q(z, t) = on/z (<r =i 5) exp (- ape r) (24) 


T=57=5— (25) 





where 
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An evaluation of (24) is shown in Fig. 1 for various normalized lengths 
Yo2. The plotted impulse responses are normalized for equal peak 
values. 

The normalizing distance 1/y., is the distance within which a 1-neper 
loss is incurred as a result of the 6-dependent loss characteristic. Note 
that additional 6-independent loss can be present. As indicated by (21), 
T is the 1/e-width, which the impulse response Q would assume at the 
distance 1/y. if no coupling were present. Closed-form solutions for 
Q are available only in the two cases discussed here, but another very 
practical characterization of the fiber output distribution can be ob- 
tained without performing the Laplace transformation. 


IV. PULSE DELAY AND PULSE WIDTH 


Because of a general relation between P(t) and its Laplace transform 
p(s), we obtain the mth moment of P(t) from 


(= a 





a af ” imPat. (26) 
0 
To achieve a suitable normalization we set m = 0 which yields 
p(s = 0) =f” Pat (27) 
0 


and divide (26) through (27). This leads to 














ae _f, @= ara G= dda 
(—1)m Sr (28) 
= ” Pat 
0 
where 
da (Pa (—OP at ae 
Os 0 (29) 
fo Pa Pat = 


The second derivative represents the variance of P(t) and, hence, 
a measure of the width of P(t): 


__ J, 4 oa (t — 6)2Pat 


” Pat 


0 


_ 0?&np 


caer ar (30) 
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The third derivative 


= _— 3 


0 3 
/ Pat O38" leno 
0 


is generally called the “‘skewness”’ of the distribution P(¢). The ratio 
n/t is a measure of the asymmetry of P and, as we shall see later, per- 
mits an immediate estimate of the value of y..z without the knowledge 
of any other fiber parameters. 

The fact that the Laplace term ns/2c appears in (9) as part of the 
sum Ac? = A + ns/2c permits us to use the relation 


a™p(8,z,s8)| (=) a™p(4, z, 0) (32) 





(31) 


os™ 2c OAm ? 





which will greatly simplify the following calculations. 
Let us now apply (30) and (81) to the general solution (12) assuming 
again the special but practical condition 0, = 0... We obtain 


n(8) = 5 | vee +(g- 5) G-et)| (33) 


for the mean delay (in addition to the overall delay nz/c) and 
T ar) 6? 2 
t p(8) = 9 | 02 + (& = i) — Wwe ee = 1) € 2%? 


apne a (F = i) ertrat] (34) 
QF 4 
for the half-width of the pulse. Figure 2 shows 6p and rp plotted as 
a function of y.z for 6 = 0 and 6 = ©. At first, a replica of the input 
pulse (7 = 0) propagates in every mode without broadening and 
merely suffers a mode-dependent delay n62/2c, as we learned already 
from (18). Very soon, however, the pulses in the individual modes 
widen; they begin to overlap even before the length 1/7. is reached. 
Once 1/y. is passed, the pulse width in all modes increases mainly as 
T(yo2)*. Compared to this increase, delay and pulse width differences 
in different modes become negligible since they cease to increase for 
large z. Specifically, 
2 

=) ; for yo2 > 1 (35) 
and 


1 
2 
5, 6\3 
(102 ae + =) for Yo2 > 1. (36) 
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dp/T OR 7,/T 





0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 
Yoo 2 
Fig. 2—Delay and time spread of the fiber output on axis and at an angle@ = O,. 


To calculate delay and width of the impulse response Q(¢), we must 
first apply the integration (19) to the general solution (12). This yields 


q = rO2(o sinh oy..2 + cosh oy~2)7?. (37) 


Now, by forming the first and second derivative of q according to (27) 
and (28), we obtain 


fa =F [re td — ee (38) 


and 


2 


The ratio r/T is shown in Fig. 3 plotted versus the normalized fiber 
length yz. For z<«1/yo, the width 7 approaches 7’y..z, as expected 
for negligible coupling. At z = 1/4y., 7 begins to follow a new 
asymptote 


eee | vee(t — 2e-trat) $B — entree 4 geo (39) 


r= (T/2)(yo2). (40) 


The quality of the approximation (40) is amazingly good even for 
small z. 
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Fig. 3—Relative width of the impulse response plotted versus the normalized 
fiber length. The two straight lines show the asymptotic behavior for very short and 
very long fibers. 


The amount by which 7 deviates from T'y..z indicates the (desirable) 
effect of coupling: The width of the impulse response increases less 
with coupling than without,® the increase being proportional to 2? 
rather than z. 


V. SOME GENERAL RESULTS FOR LONG FIBERS 


The simple approximation (40) can be obtained directly from (28) if 
g in the denominator of that equation 1s set equal to unity. In this case, 


q = 70% exp (—o702) (41) 
independent of the input condition. By applying (382) to the approxi- 
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mation (41), we obtain 


0 


where Q is the inverse Laplace transform of q(s) and hence the impulse 
response as before. Equation (42) is an important and powerful rela- 
tion which permits us to calculate all moments of the impulse response 
from the steady-state loss coefficient of the time-independent power 
flow equation. 

If we let m = 2 in (42), we obtain (40) as expected. For m = 3 we 
have 


vi 
1 = 5 (3272)! (43) 
which according to its definition (81) describes the “skewness” of Q(t). 


The ratio 
n/t = (9/y02)1!6 (44) 


is a measure of the asymmetry of the impulse response. For large z, 
n/r approaches zero and, hence, Q(t) becomes a symmetric function. 
We can compute this function by introducing ¢ = tv’ + Ty..2/2 with 
t' «< TY..z/2 in (24). The impulse response 


Q(t) = OF (217/02)? exp (—Y0% — 2t!?/T?y02) (45) 


is then Gaussian in time with the variance 7 of (40). 

The asymmetry parameter (44) can be used to determine Y,.. Par- 
ticularly if merely the order of magnitude of Y,, is of interest, this can 
be obtained, with some experience, from a quick look at the asym- 
metry of the impulse response. 

Another conveniently measurable fiber characteristic is the angular 
width ©,, of the steady-state mode distribution. It can be obtained 
from a scan of the (angular) far-field distribution at the end of a long 
fiber (2 > 1/y.). If we define the effective numerical aperture NA of 
the fiber as the sine of the apex angle of this cone of radiation (measured 
at the 1/e-points of the intensity), then 


NA = nsin 0, = nQ.. (46) 


Using (15), (16), (25), and (40), we can now write the width of the im- 
pulse response as 

NA)? : 

GN @/41—)' (47) 





T= 
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This formula clearly shows the improvement, and the penalty, that 
results from coupling. Uncoupled, uniformly attenuated modes cause 
the impulse response to broaden to an effective width of 2(NA)?/2nc 
in z km of fiber. This width can be reduced by a factor (4y.2)? in ex- 
change for an increase in the overall attenuation by 4.357. dB/km. 

The physical contents of these results can best be summarized if we 
define a ‘‘coupling length” L = 1/4y.. As shown in Fig. 3, this length 
marks the point at which the width of the impulse response changes 
from a linear to a square-root dependence on length. Together with this 
change, the impulse response undergoes a transition from the exponen- 
tial shape (21) to the Gaussian shape (45). The inverse of the coupling 
length (in km) is very nearly equal to the excess loss (in dB/km) in- 
curred because of the coupling phenomenon. 

Equation (47) as well as the previous results are limited to fibers in 
which the coupling coefficient D is independent of 6. The study of 
liquid-core fibers, on the other hand, has given us reason to believe 
that, in some fibers, D decreases with increasing 6. This characteristic 
could have a desirable effect on the impulse response and the excess 
loss, since it reduces the power flow toward the lossy modes (large 
angles) while, at the same time, enhancing the coupling among all 
other modes. We therefore studied the general case 


D(@) = D6”, y =0,1,2,..., (48) 
in some detail. 

The steady-state parameters 0,, and y., can be obtained as general 
functions of A, D,, and v by using the Rayleigh-Ritz procedure.® Due 
to (42), twofold derivation of y.. with respect to A then yields directly 
the effective width of the impulse response. This calculation leads to 

2 
= SS" ea t re? (49) 


2nc 





where NA = n@,, denotes the effective width of the output radiation 
as before. For v = 0, (49) reduces to (47). An exponent v > 0 indeed 
narrows the impulse response, although not very significantly. For 
D = D,/®‘, for example (a strong angular dependence indeed), 7 is 
only 0.7 times narrower than in the case D = const. 

Another limitation of these results is the requirement that 
©, « (2A)#. If this is not the case, the steady-state distribution is 
generally determined by a sharply rising loss term at @max = (2A)! 
rather than by the quadratic term A6?. Under these conditions we find 
that the functional relation (47) still holds, although with different 


T IN ns 
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coefficients, so that for practical distributions D(@) the width 7 can be 
up to three times smaller than indicated by (47) or (36). 

As a typical example, we shall use (36) to compute the data rates 
achievable. Let us assume that we had means to design and manu- 
facture a coupling structure in the fiber which produced the desired 
excess loss Y,. and the desired numerical aperture. For simplicity we 
assume the input pulse to be somewhat narrower than the fiber impulse 
response so that (36) gives a good measure of the half-width of the ex- 
pected output pulse. We then choose a data rate 


B = 1/2r. (50) 


We assume the core loss common to all modes to be 4 dB/km and al- 
low 50 dB of loss between repeaters. 

Using (386) we can then calculate the excess loss and the repeater 
spacing necessary for a desired data rate. These results are plotted in 
Fig. 4. A 1-dB/km excess loss decreases the possible repeater spacing 
by only 25 percent but, at the same time, triples the data rate. An at- 
tempt to further increase the data rate by even more coupling is costly: 


REPEATER SPACING IN km 
12.5 12 11 10 9 8 7 6 5 4 


DATA RATE IN Mb/s 


CORE LOSS 4 dB/km 
50 dB BETWEEN REPEATERS 





0.01 0.02 0.04 0.1 0.2 0.4 1 2 4 10 
EXCESS LOSS IN dB/km 
Fig. 4—Width of the impulse response plotted versus the excess loss incurred be- 


cause of coupling for 4 dB/km core attenuation and 50 dB loss between repeaters. 
Right side shows equivalent data rates. Repeater spacing is shown at the top. 
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Another threefold increase in the data rate requires 4 dB/km excess 
loss, and divides the repeater spacing in half. The results of Fig. 4, of 
course, are based on a uniform coupling distribution and an excess loss 
increasing as 62. As mentioned earlier, more favorable distributions 
could result in an impulse response permitting a higher data rate, al- 
though there may be practical limitations to the extent of this 
improvement. 


VI. CONCLUSIONS 


The description of fiber modes by a continuum results in a partial 
differential equation whose solution yields the response function of the 
fiber. We find a characteristic length indicating the region in which the 
impulse response changes from an exponential to a Gaussian shape. 
Beyond this length, the width of the impulse response increases only as 
the square root of the fiber length. In practical fibers, the inverse of 
this length turns out to be proportional to the excess loss incurred be- 
cause of the coupling phenomenon. The latter may represent a prac- 
tical limit to the improvement that can be gained from coupling. As 
an example, we find a data rate of 12 Mb/s achievable for 10 km re- 
peater spacing and an effective numerical aperture of 0.1. The data 
rate is inversely proportional to the square of the numerical aperture. 
Thus half the numerical aperture permits a fourfold increase of the 
data rate. Another increase of the data rate without a penalty in loss 
or numerical aperture is theoretically possible by artificially creating 
a more suitable coupling characteristic in the fiber, but it seems that 
the technological requirements for doubling or tripling the data rate 
in this way are high. 
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Coupled Mode Theory of Round 
Optical Fibers 
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(Manuscript received January 12, 1973) 


This paper presents a comprehensive theory of mode coupling in 
optical fibers with imperfections. The paper begins with the derivation of 
a general coupled wave theory based on the modes of the ideal fiber. The 
general theory is applied to a simplified description of guided and radia- 
tion modes of the fiber that is valid for small core-cladding index differ- 
ences. The simplified theory results in expressions for the coupling coeffi- 
cients that are nearly as simple as those of the slab waveguide. As an 
example, the theory ts applied to the calculation of radiation losses caused 
by pure core diameter changes and by elliptical deformations of the fiber 
core. 


I. INTRODUCTION 


Dielectric optical waveguides support a finite number of guided 
modes and an infinite number of radiation modes.! Even if the number 
of guided modes that can be supported by the waveguide is reduced to 
one, the presence of the infinite number of radiation modes forces us 
to be concerned about mode conversion phenomena. Coupling among 
the guided modes of a multimode optical waveguide (multimode wave- 
guide refers to the guided modes) is caused by imperfections in the 
refractive index distribution or the geometry of the optical waveguide. 
Its effects may be beneficial for reducing the delay distortion that re- 
sults from uncoupled multimode operation.?:? Coupling between guided 
modes and the continuum of radiation modes is usually not desired 
unless the waveguide is intended to serve as an antenna. However, a 
certain amount of coupling is unavoidable and results in scattering 
losses. 14:5 

E. G. Rawson has calculated light scattering from fiber waveguides 
with irregular core surfaces by an approximate technique.* However, 
his method is not suitable to calculate coupling between guide modes. 

817 
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A theory of radiation losses in round optical fibers has been presented 
in Ref. 7. This theory was based on a field expansion in terms of the 
exact modes of the fiber. Because of the complicated mode fields, a 
theory based on the exact modes of the guide is very tedious and re- 
sults in equations whose numerical evaluation is difficult and costly. 
However, Snyder® and Gloge® have shown that the description of the 
modes of dielectric waveguides can be greatly simplified if it is assumed 
that the difference of the refractive indices of core and cladding is only 
very slight. This assumption allows an approximate treatment of the 
mode problem, resulting in very much simpler field expressions. The 
coupled mode theory based on approximate modes yields expressions 
for the coupling coefficients that are almost as simple as those for the 
slab waveguide. The entire coupled mode theory thus becomes simpler 
and its numerical evaluation becomes cheaper. 

This paper starts out with a derivation of the coupled wave equations 
in terms of modes of the ideal guide. This mode description is somewhat 
different from the coupled mode theory in terms of local normal modes 
used by Snyder.!°!! It results in simpler expressions for the coupling 
coefficients. The exact coupled mode theory is then applied to the 
problem of coupling between the simplified waveguide modes. We limit 
the discussion to coupling caused by changes or imperfections in the 
waveguide geometry. Coupling caused by refractive index inhomo- 
geneities, which could be handled in a similar fashion with the use of 
the exact expressions for the coupling coefficients and the approximate 
mode description, is not discussed in this paper. 

Finally, we apply our results to the problem of scattering losses of 
guided modes caused by diameter changes and elliptical deformations 
of the waveguide core. We also derive simplified expressions for the 
coupling coefficients between guided modes far from cutoff and discuss 
coupling between guided modes caused by deformations of the fiber 
core and by curvature of the waveguide axis. 


II. EXACT COUPLED MODE THEORY 


The dielectric optical waveguide with imperfections is defined by 

a certain refractive index distribution n = n(z,y,z) that enters 
Maxwell’s equations: 

V XH = wwe .n*?E (1) 


VXE = —twy.H. (2) 


E and H are the electric and magnetic field vectors of a general field 
distribution in the waveguide. The fields are assumed to have the time 
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dependence e**', with radian frequency w; e, and yw. are the vacuum 
values of the electric permittivity and magnetic permeability. In addi- 
tion to the refractive index distribution n of the real waveguide, we 
consider the index distribution n, that defines an ideal guide from which 
the real guide deviates in some way. 

We decompose the fields into their transverse and longitudinal parts. 
The electric field is thus represented by the equation 


E ioe E, + E, (3) 
and the magnetic field is given by 


By using a similar decomposition of the V operator (e, is a unit vector 
in the z-direction), 


) 
V= Vit 2, (5) 


Maxwell’s equations can be written in the form 


V.xX H.+ (c. x oe) = iweonE, (6) 
and . 
v. XE, + (e. x =) es: (7) 


The longitudinal field components are expressed in terms of the trans- 
verse field components 


1 
E, = iwe,n? Vv. xX Hi (8) 
and 
1 
H, = — — V. X E.. (9) 
IW o 


The modes of the ideal waveguide with index distribution n,(z, y) are 
defined as solutions of the equations 


Vv: x Hy. — 18,(ez x 5,1) = 1w€ oN Eyt (10) 
and | 
Vi x yz as 18,(ez x &y1) = —1W po Hy. (11) 


The index v is a mode label and 8, is the propagation constant of the 
vth mode. The longitudinal components of the mode fields are similarly 
expressed as 


1 
&2 = iene Vix KH (12) 
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and 


,, = — Be VX Sx (13) 
IW o 





The transverse field of the waveguide with the index distribution 
n(x, y, 2) IS now expressed as a superposition of modes of the ideal 


waveguide. 
E, = a Ay &yt (14) 


and 
KH, ae Si by Hy. (15) 


The summation symbol in (14) and (15) indicates summation over 
guided modes and integration over radiation modes. Indicating the 
index vy by the symbol p in case that it belongs to the continuum of 
radiation modes, we have to replace 


2 dp. (16) 


The sum in front of the integral on the right-hand side of (16) indicates 
a summation over the various types of radiation modes. 

For the derivation of coupled differential equations for the expansion 
coefficients a, and b, we need the orthogonality relations of the modes 
of the ideal waveguide. 


i / e.-(8y X Ke )drdy = 2 tA Pvp. (17) 
—0 J —00 B 
The asterisks indicate complex conjugation. The symbol 6,, indicates 
Kronecker’s delta for discrete values of v and yu; it is zero if one of the 
indices labels a guided mode while the other labels a radiation mode, 
and it becomes Dirac’s delta function if both indices label radiation 
modes. 

The series expansions (14) and (15) are now substituted into the 
equations (6) through (9). Making use of the fact that the mode fields 
satisfy the equations (10) through (13), we obtain 


» iz + iB,» ) (ez xX KH,,) = dwe(n? _ niet, = 0 (18) 





and 





da , . 1 
Da {( + ip, (e, x 1) + 1W€o bv: 


: dz 
x l(a — =) (Vi X x.) || = 0. (19) 
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We take the scalar product of (18) with & and of (19) with 3¢;,. After 
integration over the infinite cross section, we obtain with the help of 
the orthogonality relation (17) 


oe + 18,4, =2> K,,a, (20) 
and 
din ; 7 
ES + 76,b, = 2>5 kd, (21) 
with the coupling coefficients 
a dda 2 »2\e*, 
or UPB. bax (n n>) Eur? & dxdy (22) 
and 
k ssh [Bul ” get, Vv; 








pe 
~ AwesP Bee hsxab dias 
n2 


x (4 — =| Vv.xX a. dxdy. (23) 


Equation (23) can be brought into a simpler form with the help of (12) 
and by performing a partial ieicae 


i. = ote Bal [2 Foc ig Qh g® 
Kw = uP pt J. 6 (ny N3) Suz Sy 2dxdy. (24) 


Finally, we introduce the amplitudes c{+? and ¢{~ of forward and back- 
ward traveling waves by means of the transformation 


Qn = el De-tBuz + cf e tBu (25) 








and 

by = cf Pe-*Bue — ol ether, (26) 
Substitution into (20) and (21), addition and subtraction of the result- 
ing equations, and regrouping of terms results in the desired coupled 
wave equations 








(+) 
sat = [KEP Veet Bu-Be + KO Nolet (Bat Br)2} (27) 
de,” (— Heal+) e-i (Bat (—,—)a(—) 9-7 (By By) 

dz ame {Ky ad Mae ae Ky C, enw an (28) 


The coupling coefficients are defined as 


Kee? = af [-@ ~ ni) {pH [Bul Sut' St 


[Bul no 
+q By on fas Ee} dxdy. (29) 
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The factors and superscripts p and q indicate the symbols (+) or (—) 
or the corresponding factors +1 and —1. The propagation constants 
8, are positive quantities. The coupled wave equations (27) and (28) 
provide an exact description of the imperfect waveguide in terms of 
normal modes of the perfect guide. The use of normal modes of the 
perfect guide results in the simple general form (29) for the coupling 
coefficients. 

Our coupled mode theory can be applied to any type of waveguide 
problem such as waveguides with refractive index inhomogeneities, 
tapers, or bends. It may be that the expansion in terms of ideal modes 
of the waveguide does not provide the most convenient basis for some 
problems. For the description of tapers, for example, we face the follow- 
ing situation. Consider a waveguide which is perfectly straight and uni- 
form up to a point where its cross section begins to increase. After 
some distance, the taper connects to a uniform waveguide of constant 
cross section. If we use the modes of the smaller guide for our mode 
expansion, we see immediately that the coupling coefficients have non- 
zero values not only on the taper itself but also throughout the entire 
waveguide of larger cross section. Even though our description is 
precise and yields the right answers, it is inconvenient for the problem 
at hand. It would be far better to use so-called local normal modes that 
do not themselves describe wave forms in any real waveguide but corre- 
spond at each point z along the nonuniform guide to the modes of 
a hypothetical uniform guide whose cross section coincides locally with 
that of the waveguide under study. Using local normal modes results 
in coupled wave equations of the form (27) and (28) but with different 
coupling coefficients. In case of the taper, these coupling coefficients 
would be nonzero only on the taper itself but would vanish on the uni- 
form waveguide sections. Local normal modes are obviously better 
suited for the description of tapers. For the description of waves in 
bent waveguides it would be most convenient to use modes that locally 
correspond to a straight waveguide whose axis is tangential to that of 
the actual guide. In addition to these problems of convenience, there 
exist problems of convergence of the series expansions (14) and (15). 
A series expansion in terms of ideal modes may converge more slowly 
than an expansion in terms of local normal modes. However, we shall 
see that we can use the series expansion in terms of ideal modes to treat 
most problems of waveguides with only slight refractive index differ- 
ences between the core and cladding materials. In addition, it is usually 
possible to guess the form of the coupling coefficients of a particular 
expansion from the coupling coefficients for the ideal mode expansion. 
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For the purposes of this paper, we restrict ourselves to the descrip- 
tion of waveguides with piecewise constant refractive index distribu- 
tions and allow only deformations of the cross section of the waveguide 
core. Figure 1 shows a typical waveguide imperfection. The refractive 
index distributions n and n, coincide inside of the core and in the clad- 
ding region. They differ only near the core-cladding boundary. If the 
boundary has moved outward from its ideal position, the index differ- 
ence n? — n2 equals nj — n3 in the region where the actual core over- 
laps the ideal cladding; it vanishes everywhere else. If the core bound- 
ary has moved inwards, we have n? — n2 = —(nj — n§) in the region 
where the actual cladding overlaps the ideal core and zero values 
everywhere else. The field components that multiply the refractive 
index difference term in (29) are either continuous, if they are tangen- 
tial to the boundary of the ideal waveguide, or they jump by a factor 
(n1/n2)? if they are normal to the ideal core boundary. If we restrict 
the discussion to core boundary displacements that are so slight that 
the fields can be considered constant over the region of the displace- 
ment and to weakly guiding fibers with (ni/n2 — 1) < 1, we obtain 
from (29) 


ee — n§)a 


Kg® = [(b@na-4) 


X [p Su & + (iz: &2]}dd. (30) 


It is noteworthy that the approximate coupling coefficient (30) is 
identical to the coupling coefficient (13) of Ref. 12 for the local normal 
mode expansion. The only difference in the appearance of these two 
coupling coefficients consists in the fact that the derivative of the 
boundary function instead of the function itself appears in Ref. 12 and 
that the entire expression is divided by 6, — 6,. It has been explained 


J — ACTUAL CORE—CLADDING BOUNDARY (x,y,z) 





vA 
IDEAL CORE—CLADDING BOUNDARY ~ 


Fig. 1—Sketch of a fiber with distorted core-cladding interface. 
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in a different place!’ that it is the Fourier component of the boundary 

function at the spatial frequency 6, — 8, that determines the coupling 

behavior of the modes. This fact makes it clear that (we write f instead 
of r — a for simplicity) 

i___— af 

1(B, — Bu) dz 


is fully equivalent to the function f itself as far as its effect on mode 
coupling is concerned. We can carry the argument one step further and 
replace f with 


(31) 


payee eee 
(By —_ Bu)? dz? 


If we replace f = r — a in (380) with (382), we obtain a coupling coeffi- 
cient that vanishes in straight, uniform waveguide sections. It is not 
hard to guess that a coupling coefficient of this type belongs to an ex- 
pansion in terms of local normal modes of a hypothetical waveguide 
that is tangential to the curved axis of the actual guide. The modifica- 
tion of (30) that is indicated by (32) is thus particularly suitable for the 
description of mode coupling caused by bends of the waveguide axis. 
A description in terms of ideal modes or even in terms of local normal 
modes of a hypothetical guide with straight axis is unsuitable for a de- 
scription of waveguide bends since it leads to coupling coefficients that 
do not vanish on the straight waveguide section behind the bend. This 
brief discussion shows that it is not hard to modify the coupling coeffi- 
cients of the ideal mode coupling theory to extend it to the case of local 
normal mode expansions of different types. 


(32) 


Ill. SIMPLIFIED DESCRIPTION OF GUIDED MODES OF THE FIBER 


A. W. Snyder® realized that the modes of round fibers and their 
eigenvalue equations simplify considerably if use is made of the fact 
that (m1/n2 — 1) «1 applies to most fibers of practical interest. D. 
Gloge® went one step further and showed that the mode fields become 
simple in appearance if they are expressed in Cartesian instead of 
the more conventional description in cylindrical coordinates. Gloge’s 
technique is useful for even more complicated waveguide structures 
such as tubes.!4 

We write down the field expressions for the guided modes of the 
round fiber without derivation. The mode fields can be polarized in 
two mutually orthogonal directions. We have for one polarization in 
the core region for r < a 
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6, = eek Raxco { ne an in 





28, —cos (v + 1)¢ f 
sin (v — 1)b 
TANS) | —cos ” — et (33) 
Ey = AJ, («r) ee 2 (34) 
_ _ tAk |e, cos (v + 1)¢ 
se ea) a | r4a(«r) i (v + Kae 
= soso {R= BOY] 
ere By Eo 2 Cos vd ; 
vase lea | By | Ko Jak r) i St (36) 


The field in the (infinite) cladding region for r > a is obtained from the 
field expressions (33) through (86) by replacing the amplitude constant 
A with [J,(xa)/H (¢ya) JA. In addition, we replace x with 7y and the 
Bessel function J,(«r) with the Hankel function of the first kind 
H{ (tyr). The parameters «x and vy are defined as (k? = we ou) 

x = (nik? — B7)} (87) 
and 

y = (BF — n3k?)}. (38) 
The remaining field components vanish, 6, = 0 and 3X, = 0. The two 
sets of circular functions that are shown in the field equations are neces- 
sary to obtain a complete set of orthogonal modes. The functions in the 
upper as well as those in the lower position belong together. We have 
used 7 to indicate n ~ ny = ne. 

The set of guided modes is still not complete unless we also include 

the orthogonal polarization. We have again for r < a(6, = #, = 0) 


g, = tse | Jesse) ee of Hi ae) {en = Del | ™ 





28, sin (» + 1)¢ sin (v — 1)¢ 
8. = Adler) {5°78 (40) 
a= SEE [suste {286 T oat 
soso { S86 2H) en 
ae, = nA a Je J,(cr) (a a. (42) 
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The field in the cladding is again obtained by the replacements men- 
tioned above. 

The mode amplitudes must be related to the power P (that is the 
same for all the modes). We have for the fields (33) through (36) and 
for the orthogonally polarized field (39) through (42) 


44 jae yP 
Eo 


ee CLC a Re COLA Coll ae 
with 
2 for vy=0O 
ee i for y #0 - 


The eigenvalue equation of these simplified fiber modes is 


Jra(xa) _ HE (bya), 
“Tka)  ” H&Gya) 


With the help of the functional relations of the cylinder functions, it 
is easy to show that (45) is also valid if » — 1 is replaced by » + 1. 

The simplified guided modes listed here are not the same as the usual 
HE and EH modes of the round fiber. It can be shown®:’ that the simpli- 
fied modes listed in this paper result from the usual fiber modes as super- 
positions of an HE and an EH mode. The HE,41,, and EH,-1,, modes 
have very nearly the same propagation constant, they are almost de- 
generate. Since this degeneracy is not perfect, our simplified modes are 
not modes in the true sense of the word. They decompose into the 
HE,.1,, and EH,_1,, modes of the round fiber as they travel along the 
waveguide thus changing their shape. A true mode is defined by the 
fact that only its phase changes (in the lossless case) as it travels down 
the guide. However, the approximate modes do form a complete 
orthogonal set of modes and can thus be used to express any field that 
can exist in the fiber. Even after one of the approximate modes has 
decomposed into HE and EH modes, it can again be expressed in terms 
of approximate modes at this point. The fact that the approximate 
modes are not true modes in the usual sense does not limit their useful- 
ness for studying mode conversion and radiation problems. 

The important fundamental HEy; mode of the fiber corresponds to 
the lowest-order approximate mode with » = 0. This is a true mode 
that does not decompose as it travels along the waveguide. 





(45) 


IV. SIMPLIFIED RADIATION MODES OF THE FIBER 


The radiation modes of the fiber can again be simplified by using 
(ni/n2 — 1) < 1.1* There is a slight complication, however. The simpli- 
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fied description of the guided modes was made possible by the fact that 
they are very nearly transverse modes, their transverse components 
being much larger than their longitudinal components. The radiation 
modes are nearly transverse only if their propagation constants are 
nearly 6 = nek. Since the continuous spectrum of radiation modes ex- 
tends from 6 = —n2k to 8. = nek, only the modes in the immediate 
vicinity of the two end points of this interval are also nearly transverse 
modes. Throughout most of the spectral region of 8 values, the ap- 
proximation corresponding to that for the guided modes does not 
work. However, we can still use the fact that the refractive indices of 
core and cladding are nearly identical and use the radiation modes of 
free space in the region where our mode approximation technique fails. 
A simplified treatment of all the radiation modes is thus also possible. 
The two approximations complement each other. In the region near 
8 = -Enek, where we use the approximate radiation modes of the guide, 
the free-space radiation modes do not work very well because reflec- 
tions at the core-cladding interface at grazing angles are important. 
Inside of the 8 range, where the waveguide mode approximation 
method fails, we can use the free-space radiation modes with confidence 
since the interface does not cause much reflection for waves passing 
through it at reasonably steep angles. 

We begin by listing the approximate radiation modes of the fiber for 
6 near --nok. The field equations are very similar to those of the guided 
modes. In the fiber core at r < a we have 


6, = abe | sesalor) { aac it 


| 


28 —cos (v + 1)¢ 
sin (v — 1)¢ 
+ Jy-a(or) i ae A (46) 
b= BIG i ; ‘i (47) 
_ _ tBa feo cos (v + 1)¢ 
eR, = — Ok ue | Sessler) es (v ae He: 
cos (v — 1)¢ 
on ee i a Aa (48) 
- B Eo COS vb 
We = —nB 1 a/J,(or) ee ot (49) 


The remaining field components vanish. The propagation constant 6 is 
a continuous variable for radiation modes unrestricted by an eigenvalue 
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equation. The parameter o is defined as 
o = (nik? — B?)} (50) 


Instead of specifying the modifications that are required to transform 
the expression of the field inside of the core into the expression for the 
cladding field, we state the field in the region r > a in detail. 


8, = FP | caeater) + DEelon) | _ON TS 
+ (H®2,(or) + DH®(or)) | eee (51) 
by = CCHS) + DUP (or) | 2% (52) 
se, = — P|] (H1s(0r) + DEP) era 
— (HEA(or) + DHBalon) { O8 — DE |] (oa 
See = —nC Er a) $2 (Mor) + DEP (on) | O° mt: (54) 


A® and H® are the Hankel functions of the first and second kind. The 
parameter p is defined as 
p = (nzk? — 6?)}, (55) 


The amplitude coefficients are 


C= 4 ToJ,41(00)H (0a) — pJ,(oa)H@a(pa)B (56) 


and 

oJ 41(00)H (pa) — pJ,(oa)HP2i(pa) | 
oJ y41(0a)Hy(pa) — pJ (oa) H¥21(pa) 
For the field with the orthogonal polarization, we simply state the field 
expressions inside of the core. It should be apparent from inspection of 
(46) through (57) how the field expression in the cladding is obtained 
from that of the core. The relations between the amplitude coefficient 
are the same in either case. We have for r < a 


eed | Josa(or) ee @+ a 


26 sin (v + 1)¢ 
cos (v — 1)¢ 
_— J,-1(or) ee (v _ wad (58) 


D=- (57) 
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8, = BJ,(or) i sal (59) 


sin vd 


_ Bs |e sin (v + 1)¢ 
= Be | Talon —cos (v + al 


sin (v — 1)¢ 
Tedeie) —cos (v — 1)¢ }} (60) 


“ J,(ar) a ine (61) 


se sin vp 


BT 
It remains to relate the amplitude coefficient to the power P. Because 
of the continuous mode spectrum, it is not possible to normalize the 
radiation modes with a finite amount of power. The parameter P is 
defined by the relation 


se Se ee * 
sf f (8, X 3%) -e,dedy = a Piea): (62) 
The amplitude coefficient B is thus [e, is defined by (36) ] 
(# Jp P)} 
Ris (63) 


enar| oJy1(0a)H (pa) — pJ(oa)H®;(pa)| 


It is important to remember that the radiation modes listed so far are 
valid only in the immediate vicinity of 6 = -+-n2k. Inside the @ range, 
we use the radiation modes of homogeneous space with refractive 
index n2. These modes can be expressed in a number of different ways. 
The simplest expressions would result from a plane-wave representa- 
tion. However, for our present purposes, it seems advisable to use field 
expressions that resemble most closely the radiation modes (46) 
through (54) and (58) through (61) in order to achieve continuity of the 
field expressions throughout the entire 8 range. The modes of the 
homogeneous medium (in vacuum we would say free-space modes) are 
simpler than the radiation modes of the fiber, since one expression ap- 
plies throughout all of space. There is no need to treat the fields inside 
and outside of the core separately. The field expressions that satisfy 
our requirements are [/p is defined by (55), we use n = nz | 


_ iCp sin (» + 1)¢ 
oo = 8 pL Yenlon ee (+ Def 


+ Jrator) | one Het] oo 
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CJ,(pr) a a (66) 


sin v@ 


_ tp feo cos (v + 1)¢ 
ee, = — ee | Joss(or) ta (v ats a 


— Frater) Be ET Het] cn 


w= — Sf [2 (8+ 8) sion [core] 
)(emnton {a EBs} 

Ba mr () i : = a \)| on 
sey = Ff (F—ME| Flor) {_ om Te 


~sndon{ S881). 


Go 
e 
I 


+ 
~~ OS 
>| 
S1D 


The os polarized modes are 
cos (y+ 1l)p{ _ cos (v — 1) 
6-55 au riler) a a a Jester) ee a ad © 


8. = CS.,(pr) “4 (71) 


sin vd 
6, = 0 (72) 


tC'p sin (v + 1)¢ 
KK, = oN J y41(p7) { —cos (v ae 1)¢ } 
+ Frater) { SEO St] a 


cos (v — 1)¢ 
_ CC fefB nk sin (vy + 2)¢ 
seam — Gl (E— Fe) [toste {oe ef 
sin (v — 2)¢ 
= Fns(ory | Sh? st] 


n cos vd 

a ) Jolo ne “ 
nk cos (v + 2)¢ 
ee (J r+2(or) i Ga A 


+ Frater) {See POLY). 76 
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The amplitude coefficient C is related to P 

4 = kpBP 
e,1 (8? + n?k?) 
The modes of the homogeneous medium, like all the other modes, are 
mutually orthogonal among each other. With the help of the relations 


C= (76) 


Jo) HBAa) — SnsleHO (a) = (77a) 
and 
TA@)H BA) — Jnss(@) HP (@) = — (77b) 


it is easy to show that the radiation modes of the homogeneous medium 
and the radiation modes of the round fiber reduce to the same expres- 
sions in the limit |6| = nok, n1 = ne = n. 


V. COUPLING COEFFICIENTS FOR CORE-CLADDING IMPERFECTIONS 


We consider a fiber whose core-cladding interface is described by the 
function 


r(x, Yy, 2) = a + f(z) cos (md + ¥). (78) 


If we choose a different function f(z) and different phase y for each 
integer m and sum over the second term on the right side, we generate 
a Fourier series which allows us to describe core-cladding imperfections 
of the most general kind. 

We assume that a given mode labeled » is traveling in the waveguide 
and ask for the coupling from this mode to all other guided and radia- 
tion modes. The function f(z) can be separated out from the coupling 
coefficient by defining : 

Kw = Ky fe). (79) 


We have mentioned earlier that the longitudinal field components 
of the guided modes are much smaller than their transverse compo- 
nents. The same statement is true for the radiation modes only if their 
propagation constant @ is very nearly equal to --n2k. For |8| values 
much smaller than n2k, the longitudinal field components of the radia- 
tion modes can be as large as or larger than the transverse components. 
The coupling coefficients contain scalar products of the two fields that 
are coupled together. Coupling coefficients that involve at least one 
guided mode are thus determined primarily by the transverse compo- 
nent of both fields, since the product of the longitudinal components is 
small over most of the range of 6 values. We thus neglect the contribu- 
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tion of the longitudinal components and gain the advantage of much 
simpler expressions for the coupling coefficients. The only region of the 
8 range where the longitudinal components could make a significant 
contribution to the coupling process is near 8 = 0. Outside of a small 
region near 6 = 0, our approximation is reasonably accurate. To this 
approximation, modes with orthogonal (transverse) polarization are 
not coupled by core-cladding imperfections of the form (78). 

For coupling between two guided modes v and p» we obtain from (30) 
and (79) with the help of the field expressions 


ir (vq) —_ uvm Dv wl v( Kya) T w(K wO) ; 
Keo = : : (80) 
Neuey 2iankL | Jr—1(K a) I v4.1(Ka)S pa(k@)S pga(x yar) | J? 


The factor €yym can assume the values 4, 2, 1, or 0. It is zero unless 
uw =v +m. Table I shows the values of e€ym for all possible cases. The 
factor e, is defined by (44), x, and y, are determined by (37) and (88). 

Coupling between the guided mode » and radiation modes » must be 
described by two different coupling coefficients depending on the value 
of the propagation constant 6 of the radiation mode. For values of || 
close to n2k, we use the radiation modes of the fiber and obtain 


a €uvm 
K ve as 
Veen 





3 
p( 2 = 1) yvlodoleoa)J (ea) 
Fr Jy alta) Trya(t0) |?) oJ, a(ca) Hy, (pa) — pJy(ca) Hi? 1(pa) | 
(81) 


For 6 values inside the range —nok < 6 < nk excluding the end points, 
we use the radiation modes of homogeneous space and find 


- Pp E (= = 1) kB | aedo(00a)¥ (00) 
nvm ak Ne . 
Vesey, *[2(62 + n?k?) | Jy-1(%0)Jy4.1(x,a) | JF 


Use of (77) allows us again to see that (81) and (82) become identical 
in the limit m1 = nz = n, |B| = nok. 

The coupling coefficients for the approximate guided and radiation 
modes of the round optical fiber allow us to solve a large number of 
problems involving fibers with core-cladding interface irregularities. 





Rye = (82) 


VI. FAR-FROM-CUTOFF APPROXIMATIONS 


For purposes of multimode operation it is often desirable to have 
simple expressions for the coupling coefficients which are valid far from 
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TABLE I—TABULATION OF THE FACTOR €yym FOR ALL POSSIBLE 
CoMBINATIONS OF ANGULAR FIELD DEPENDENCE OF THE MopEs 


AND THE CORE-CLADDING INTERFACE DISTORTION 


€uwm = O unless specified otherwise 


incident mode spurious mode distortion 
COS vp COS ud cos mp 
4 p=p=m= 
9 7 = 0, w=m 
Cone = w=0, v=m 
1 O<np=rem 
0O<n=m->p 
incident mode spurious mode distortion 
sin vp sin ud cos mp 
0 vornp=0 
Cum =\+tl O<p=vem 
—-1 O<p=m-—pv 
incident mode spurious mode distortion 
COS vp sin ud sin mp 
0 n= 
2v=0, p=m 
Con = 1 p=v+m 
—-1 O0O<yp=rv-m 
1 0<p=m-y 
incident mode spurious mode distortion 
COS vp sin ud COS Mp 
Cum = 0 
incident mode spurious mode distortion 
COs vp cos ud sin md 
Cum = 0 
incident mode spurious mode distortion 
sin vp sin ud sin md 
Cum = 


cutoff. We obtain such approximations by using the approximation for 
large argument of the Hankel function (ya > 1) 


[ 2 ; 
eovan a —i[r/4t+y(r/2)]lp—-ya 
Hiya) = ia e774, (83) 
With (83), we obtain from (45) for ya > 1 
yJ (xa) = —xKJ,-1(ka). (84) 


We remarked earlier that (45) is also valid if » — 1 is replaced with 
vy + 1. We thus also have 


yJ (Ka) = KJ y41(Ka). (85) 
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Multiplying (84) with (85) and taking the square root results in 


yd (Ka) = Kl —Jy-1(ka) J y4.1(ka) |? 
KL | Jy-1(xa) J p41(ka) | J. (86) 


Far from cutoff, (86) allows us to write the coupling coefficient (80) 
between guided modes in the simple form 


Kgo = Men“ 87 
. Ve,e, 2iank ey 


Very far from cutoff, ya — », we obtain the approximate eigenvalue 
equation 
J,(x,a) = 0 (88) 


from (84) or (85). The roots of the Bessel function J,(x,a) thus deter- 
mine the values of «,a that appear in (87). For higher-order modes x,a 
becomes large so that we can approximate 


TGs NEE cos E . (> - 5) 5: (89) 


Equation (88) requires that the argument of the cosine function equal 
(2N + 1)7/2. This leads to a direct determination of 


Ky © (> + 2N + 5) : (90) 


with N = 0, 1, 2---. The equations (87) and (90) provide us with an 
approximate determination of the coupling coefficient between two 
guided modes without the need for solving a transcendental eigen- 
value equation. In a strict sense, we would have to label x and the 
coupling coefficient with N as well as v. We refrain from burdening the 
symbols with too many indices. 

The coupling coefficients between guided and radiation modes can 
similarly be simplified far from cutoff of the guided modes. The far- 
from-cutoff approximation of (81) is 


} 
Pp (@ = 1) VpkrJ y(oa) 
Rge = “ee = 


This equation is valid only for |8| values very close to n2k. The coeffi- 
cient that describes coupling between guided modes and the radiation 
modes of homogeneous space, eq. (82), leads to the far-from-cutoff 
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approximation 


2 P E (= S 1) a8 | ool s(n 
Keo = =8@ oN fd (92) 
Veen 1L2(8? + n?k?) 


This expression is valid inside the range —nk < 8 < nk but not near 
|8| = nek. For most scattering problems, it is sufficient to use the 
coupling coefficients to the radiation modes of the homogeneous 
medium. Only if the scattering is sharply forward or backward directed 
do we have to use the slightly more complicated expression (91) of the 
coupling coefficients to the true radiation modes of the round fiber. 
In the remainder of the paper, we apply our results to special cases. 


VII. COUPLING CAUSED BY WAVEGUIDE BENDS 


We consider the case of a straight fiber that is connected to a fiber 
section that is bent with a constant radius of curvature and finally con- 
tinues in a straight section. If the curved piece of waveguide causes 
considerable mode conversion, the system of coupled equations (27) 
and (28) must be solved. However, for slight mode conversion, we can 
use the approximation that the incident mode does not change very 
much while power builds up in some of the spurious modes. In this 
case, we obtain the following approximate solution from (27): 


L 
HD) Seo) i K yp(z)e% 80~Be)edz, (93) 


We assume z = 0 at the beginning of the curved section of length L. 
The description (32) is most appropriate in this case. The second de- 
rivative assumes the constant value 1/R, with R being the radius of 
curvature of the circular bend. We thus obtain from (79) [with f 
replaced by (32) ] and (93) 


o(L)|? _ 1 4|Ryl? 
c,(0) FR? (3 > By)® 


We obtain K,, from (80) or in its ‘“far-from-cutoff”’ approximation 
from (87). The integer m appearing in (78) must be set m = 1 in this 
case, since we want to describe a continuous offset of the fiber which 
results in a bend. It is apparent that the amount of power transfer be- 
tween the incident and the spurious mode depends critically on the 
separation between the two propagation constants. The sine factor in 
(94) describes the phasing between the two modes. If the incident and 
spurious modes travel with equal phase velocity, power would be 





sin?(6, — B,) z (94) 
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transferred from the incident mode to the spurious mode in proper 
phase so that all the power could be exchanged between the modes. If 
both modes have different phase velocities (and our formula holds only 
in this case), the two modes get out of step so that after some distance 
the power that is fed from the incident mode to the spurious mode tends 
to interfere destructively and destroys the power that has already been 
transferred. The power in the spurious mode thus builds up and decays. 
This process does not involve reconversion of power from the spurious 
to the incident mode, since this process is not included in our perturba- 
tion solution. 

It can be shown that a guided mode of the type (34) produces a ring- 
shaped far-field pattern if it is allowed to radiate out of the end of the 
fiber. The maximum of the ring as seen from the end of the fiber ap- 
pears at an angle 


6, = (95) 


Ky 
4 
The circular far-field pattern on a screen is broken up into 2y bright 
dots corresponding to the angular intensity maxima of the field dis- 
tribution in the fiber. The angle @, is useful to distinguish guided modes 
experimentally. It may thus be of interest to express (94) in terms of 
this mode angle. Using (87), (95), and 


Pe a Le ee (96) 
ea) Se), ea 
we can write (94) in the following form: 
Cul? Ginn __—2n*8O ig gy L. 
Cy|  6€, k4a2R2(62 — 62)8 sin’(B. — Br) 2 (97) 








For all practical applications, the separation between the angles 6, and 
6, is so small that we can replace them with one angle 6. According to 
(90) we get for the difference 

Tv 


6, — Oy = AO = + 5 


(98) 
if » =»v+1. We restrict the discussion to coupling between modes 
with the same value of N but adjacent v values. If the results of this 
discussion are implemented, (97) assumes the form 


2 = ey 26k2q4n4 


Cu 
€,é, mh 6? 


Cy 








sin? (8, — B,) z (99) 
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This equation shows that the amount of power transfer caused by 
waveguide bends decreases with increasing mode angle. 

Next we consider random bends. It has been shown in Refs. 3 and 17 
that the exchange of power among randomly coupled modes can be 
described by coupled power equations. The power coupling coefficient 
is given by 

ye = | Ky |B. = Ba). (100) 


F(6, — 8,) is the power spectrum of the coupling function f(z) or of its 
equivalents (31) or (82). Using (32) and writing 1/F instead of d?f/dz?, 
we obtain for the power coupling coefficient for random bends 


_ Sint KK z) 
hw = oe, dam hB, — Bt” (; ey 


with the power spectrum of the curvature function 
2 
): (102) 


C (G ) = (| * 1 oseu-Bedg 
R VL Jo RZ) 
The symbol ( ) indicates an ensemble average. 

The guided modes suffer radiation losses even in uniformly bent 
waveguide sections, R(z) = const.8!9 These curvature losses cannot 
be obtained by perturbation theory and thus are not included in our 
discussion. However, our perturbation theory includes mode conver- 
sion losses between guided modes and radiation losses caused by 
changes in the waveguide curvature. 

In terms of the mode angle @, (102) can be written in the form 


2 2Jn272 
a c(z): (103) 


Ce, 1 R 








The dependence on the mode angle is contained only in the power 
spectrum of the curvature function. 

With the help of the coupling coefficients (91) and (92), radiation 
losses caused by random bends can be calculated. However, the explicit 
expression will not be given here. 


VIII. MODE CONVERSION AND LOSSES DUE TO DISTORTED CORE-CLADDING 
INTERFACES 


Instead of writing down the general formula for the power coupling 
coefficient between guided modes based on (80), we restrict ourselves 
to the far-from-cutoff approximation. From (87), (95), and (100) we 
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obtain 


2 292 
Cxm 7050; 


CE pn 4a?n? F(B, = By) (104) 


(af, f(z)ei Bu-B») 2qz ’): (105) 


f(z) is the function that appears in (78). Equation (104) is remarkably 
similar to eq. (37) of Ref. 3 which was derived for the slab waveguide 
model. For comparison of the two equations, it is necessary to re- 
member that we have assumed that the angles 6, and 6, are much 
smaller than unity and that yd > 1 according to our far-from-cutoff 
approximation. The difference in the position of n in the two equations 
is attributable to the different definitions of the mode angles. In our 
present discussion, we consider 6, as the angle of the far-field radiation 
cone outside of the fiber, while this angle was defined as the angle of 
the plane waves inside of the fiber core in Ref. 3. The correspondence 
between the two coupling coefficients requires us to consider the case 
of pure diameter changes, m = 0, and assume that both modes have 
no circumferential variation, »y = 0 and » = O. In this case, we have 
€750/(4€.€0) = 1 instead of the factor 1/2 appearing in (37) of Ref. 3. 
The difference is accounted for if we remember that the round fiber 
corresponds to a slab in which both interfaces have irregularities which 
are perfectly correlated. The slab waveguide theory of Ref. 3 assumed, 
on the other hand, that the two interfaces had uncorrelated irregulari- 
ties. This comparison shows that the results of the slab waveguide 
theory and the round fiber are in very good agreement. Our present 
formula (104) holds for core-cladding interface irregularities of a much 
more general kind. Not only pure diameter changes but elliptical 
deformations and deformations of even more general shapes are 
included. 

Next, we turn to the problem of radiation losses. The power loss 
coefficient is defined by [compare (9.38-14) and (9.3-42) of Ref. 1] 








py 


with 


F(6, By) 





nok a 
w= Ef [Rel — 5) 2 ap (106) 


with K,, being the coefficient for coupling between a guided mode v 
and a radiation mode with angular symmetry u and propagation con- 
stant 6. The power spectrum F is defined by (105) with 6, replaced by 
8. Not much can be gained from substituting (81) and (82) or their 
far-from-cutoff approximations (91) and (92) into (106). The integral 
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in (106) is hard to solve and useful approximations covering the whole 
range of correlation lengths have not yet been found. However, using, 
for example, (81) and (82) in (106) simplifies the numerical integration 
compared to the problem discussed in Ref. 7. The radiation losses can 
be calculated from (81), (82), and (106) with much simpler computer 
programs and at considerable savings compared to the theory of Ref. 
7. We use an exponential correlation function 


R(u) = (fz) fle + u)) = @? exp (— |u| /B) (107) 
and obtain (Ref. 1, p. 371) 
2a? 


Bl (By + Fi i ve 


The resulting radiation losses for pure diameter changes, m = 0, are 
plotted for the HE,; mode, v = 0, in Fig. 2 as functions of the ratio of 
correlation length B over core radius a for n;/nz2 = 1.01. The curves 
were obtained by numerical integration of (106) with K,, of (81) in 
the range 0.95n2k S |8| S nok and with K,, of (82) in the range 
—0.95n2k < B < 0.95n.k. For small index differences between core 







ny/ng=1.01 
m=Q0 
v=pe=0 


10-2 10-1 1 101 102 103 
B/a 


Fig. 2—Normalized radiation losses of the HE;: mode, » = 0, as functions of the 
ratio of correlation length B to core radius a for different values of ka = 2zxa/X for 
pure diameter changes, m = 0. i1/n2 = 1.01. The dotted line results from using 
only “free space’’ radiation modes. 
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and cladding, the losses calculated from our simplified theory are in 
perfect agreement with the theory of Ref. 7. For larger index ratios, our 
approximate theory begins to fail. For ni/n2 = 1.48, the error caused 
by our approximation is in the order of 60 percent. The simplification 
gained from using (81), (82), and (106) is apparent by glancing at the 
complex formulas of Ref. 7. 

The dotted line in Fig. 2 was computed by using the radiation modes 
of homogeneous space alone so that (82) instead of a combination of 
(81) and (82) was used in (106). It is apparent that the radiation modes 
of homogeneous space are not suitable to calculate the radiation losses 
for large values of B/a. It was pointed out in Ref. 5 that large B/a 
ratios lead to forward scattering. The radiation makes small angles with 
the core-cladding interface so that reflection at this interface becomes 
important. It is thus necessary to use the radiation modes of the fiber 
for B values near nok. 

Our theory allows us to calculate radiation losses for more general 
core-cladding interface distortions. As a second example, we consider 
elliptical deformation, m = 2, and plot the result of the numerical inte- 
gration of (106) in Fig. 3. The power spectrum (108) of the function 
f(z) [defined by (78) ] was used again. We also used a combination of 
radiation modes of the fiber and of free-space radiation modes in the 
same way as indicated before. The radiation losses caused by elliptical 
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(a3/G2) a 
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Fig. 3—HE;: mode radiation losses caused by elliptical deformations, m = 2, of 
the core-cladding boundary. n1/n2 = 1.01 
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core-cladding interface irregularities are smaller than those of pure 
diameter changes. 

Actual numerical values of radiation losses obtained from curves 
like Fig. 2 and Fig. 3 were discussed in previous publications.!:>)7 


IX. CONCLUSIONS 


We have presented a simplified theory of mode coupling in imperfect 
round optical fibers. The simplification was a result of restricting the 
discussion to fibers with small values of ni;/n2 — 1. The simplified 
theory results in much simpler expressions for the guided and radiation 
modes of the fiber and consequently leads to simple expressions for the 
coupling coefficients. For small core-cladding index differences, the 
simplified theory is in excellent agreement with more general theories. 

The principal contribution of this paper is a tabulation of coupling 
coefficients for coupling between guided and radiation modes that are 
necessary for solving mode coupling problems caused by general core- 
cladding interface imperfections. A general coupling theory based on 
the modes of the ideal fiber is also presented.?° 
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Receiver Design for Digital Fiber Optic 


Communication Systems, I 


By S. D. PERSONICK 
(Manuscript received January 15, 1973) 


This paper is concerned with a systematic approach to the design of the 
“linear channel” of a repeater for a digital fiber optic communication 
system. In particular, it is concerned with how one properly chooses the 
front-end preamplifier and biasing circuitry for the photodetector; and 
‘how the required power to achieve a desired error rate varies with the bit 
rate, the received optical pulse shape, and the desired baseband-equalized 
output pulse shape. 

It ts shown that a proper front-end design incorporates a high-imped- 
ance preamplifier which tends to integrate the detector output. This must 
be followed by proper equalization in the later stages of the linear channel. 
The baseband signal-to-noise ratio 1s calculated as a function of the pre- 
amplifier parameters. Such a design provides significant reduction in the 
required optical power and/or required avalanche gain when compared to 
a design which does not integrate initially. 

It ts shown that, when the received optical pulses overlap and when the 
optical channel is behaving linearly in power, baseband equalization can 
be used to separate the pulses with a practical but significant increase in 
required optical power. This required power penalty is calculated as a func- 
tion of the input and equalized pulse shapes. 


I. INTRODUCTION 


The purpose of this paper is to provide insight into a systematic ap- 
proach to designing the “linear channel” of a repeater for a digital fiber 
optic communication system. 

In particular, we are interested in how one properly chooses the bias- 
ing circuitry for the photodetector; and how the required power to 
achieve a desired error rate varies with the bit rate, the received optical 
pulse shape, and desired baseband output pulse shape. 

Throughout this paper, performance will be measured in terms of 
signal-to-noise ratios. Efforts to calculate exact error rates and bounds 
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to error rates are difficult to carry out, and, in the past, the results of 
such efforts have shown little deviation (for practical design purposes) 
from calculations of error rates using the signal-to-noise ratio (Gaussian 
approximation) approach. (See Refs. 2 through 5 and Appendix A.) 


II, INPUT-OUTPUT RELATIONSHIPS FOR AN AVALANCHE DETECTOR 


An avalanche photodiode is the device of interest in fiber applica- 
tions for converting optical power into current for amplification 
and equalization, ultimately to produce a baseband voltage for 
regeneration. 

In order to appreciate its performance in practical optical systems, 
we have to characterize the avalanche photodiode from three points of 
view: the physical viewpoint, the circuit viewpoint, and the statistical 
viewpoint. 

When we study the device from the physical viewpoint, we ask 
how does it operate, how do we develop circuit and statistical models 
of its operation, and what are the limitations of the models. 

From the circuit viewpoint, we investigate how to design a piece 
of equipment in which the device will perform some function. 

From the statistical viewpoint, we investigate the probabilistic 
behavior of the device to allow us to quantify its performance in a 
circuit. 


2.1 The Physical Viewpoint 


The avalanche photodiode is a semiconductor device which is nor- 
mally operated in a backbiased manner-—producing a region within the 
device where there is a high field (see Fig. 1). Due to thermal agitation 
and/or the presence of incident optical power, pairs of holes and elec- 
trons can be generated at various points within the diode. These car- 
riers drift toward opposite ends of the device under the influence of the 
applied field. When a carrier passes through the high-field region, it 
may gain sufficient energy to generate one or more new pairs of holes 
and electrons through collision ionization. These new pairs can in turn 
generate additional pairs by the same mechanism. Carriers accumulate 
at opposite ends of the diode, thereby reducing the potential across the 
device until they are removed by the biasing and other circuitry in 
parallel with the diode (see Fig. 2). The chances that a carrier will gen- 
erate a new pair when passing through the high-field region depends 
upon the type of carrier (hole or electron), the material out of which 
the diode is constructed, and the voltage across the device. To the ex- 
tent that carriers do not accumulate to significantly modulate the 


DIGITAL FIBER OPTIC REPEATER I 845 


AN COLLISION 


INITIAL IONIZATIONS 
PAIR v™N 





FIELD 
Fig. 1—Avalanche detector. 


voltage across the device, it can be assumed that all ionizing collisions 
are statistically independent. This assumption also requires that the 
mean time between ionizing collisions be large compared to the time it 
takes for a carrier in the high-field region to randomize its momentum. 


2.2 The Circuit Viewpoint 


From the discussion above, and of course more detailed investiga- 
tions,*6-§ it has been concluded that a reasonable small-signal model 
of an avalanche photodiode with a biasing circuit shown in Fig. 2 is the 
equivalent circuit of Fig. 3. In Fig. 3, Ca is the junction capacitance of 
the diode’ across which voltage accumulates when charges produced 
within the device separate under the influence of the bias field. The 
current generator 7(t) represents the production of charges (holes and 
electrons) by optical and thermal generation and collision ionization in 
the diode high-field region. In order to use the photodiode efficiently, 









LARGE 


R P 
CAPACITOR a 





Fig. 2—Detector biasing circuit. 


Tt Not to be confused with the large power supply bypass capacitor of Fig. 2. 
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i(t) OUTPUT 


Fig. 3—Equivalent circuit of biased detector. 


we must design a circuit which will respond to the current 7(¢) with as 
little distortion and added noise as possible. 

In order to derive information from the circuit responding to 7(é), 
we must understand the statistical relationship between z(t) (the equiv- 
alent current generator) and the incident optical power p(t). 


2.8 The Statistical Viewpoint 


In Fig. 3, the current source z(t) can be considered to be a sequence 
of impulses corresponding to electrons generated within the photo- 
diode due to optical or thermal excitation or collision ionization. We 
shall now specify, in a statistical way, how many electrons are produced 
and when they are produced. 

From various physical studies,*:79 it has been concluded that for 
cases of current interest the electron production process can be modeled 
as shown in Fig. 4. 

Let the optical power falling upon the photon counter be p(t).? In 
response to this power and due to thermal effects, the photon counter of 
Fig. 4 produces electrons at average rate \(¢) per second where 


A(t) = [(n/hQ)p@) J + Ao, (1) 
where 
n = photon counter quantum efficiency 
hQ = energy of a photon 


ho = dark current “counts” per second. 


A(t) is only the average rate at which electrons are produced. In any 
interval 7 seconds long, the probability that exactly N counts are pro- 
duced is given by 

ANe7A 


N! ? 





PLN, (to, to i T) ] = 
where 


ee i | on" Nd. (2) 


t The reader is cautioned not to confuse p(é), the optical power, with the prob- 
ability densities (e.g., PLN, {¢,}]) in this paper. 
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Fig. 4—Model of is(¢) generation process. 


Given p(t), the number of electrons produced in any interval is sta- 
tistically independent of the number produced in any other disjoint 
interval. 

A process of impulse (electron) production satisfying (2) and the 
above independent increments condition is said to be a “‘Poisson im- 
pulse process” with arrival rate \(¢).! 

A useful equivalent description of the above process follows. 

If 7 is an interval, the probability that exactly N electrons will be 
produced at the (approximate) times f; + 3A, t2 + 3A, --- tn + 3A 
where the widths A are very small is 


N 
PLN, {t}] = {e* IL EM) AY/N!} + 0(4), (3) 
where A is defined in (2) and o0(A) is a term such that 
tn 264) 
er a an 


It is ¢mportant to note that in (3) the times {é;} are not in order, that is, 
in (3) it is noé necessarily true that t; < te, ete. 

Each of the “primary” impulses (electrons) produced by the photon 
counter enters a random multiplier where, corresponding to collision 
ionization, it is replaced by g contiguous “‘secondary”’ impulses (elec- 
trons). The number g is governed by the statistics of the internal gain 
mechanism of the photodiode. Each primary impulse (electron) is 
“multiplied” in this manner by a value g which is statistically inde- 
pendent of the value g assigned to other primaries. 

Thus the current leaving the photodiode consists of “bunches” of 
electrons, the number of electrons in the bunch being a random quan- 
tity having statistics to be described below. For applications of in- 
terest here, it will be assumed that all electrons in a bunch exit the 
photodiode at the time when the primary is produced. This implies that 
the duration of the photodiode response to a single primary hole- 
electron pair is very short compared to the response times of circuitry 
to be used with the photodiode. 
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Fig. 5—Receiver. 


Different avalanche photodiodes have different statistics governing 
the number of electrons in a bunch, i.e., the gain. For applications be- 
low, we will only need to know the mean gain (g) and the mean square 
gain (g?). For a large class of avalanche photodiodes of interest, it has 
been found that?:7 


(g?) & (g)?**, (4) 


where (g) is determined by the applied bias voltage and x, a number 
usually between 0 and 1, depends upon the materials out of which the 
diode is constructed. For germanium photodiodes, « = 1; for well- 
designed silicon photodiodes, x ~ 0.5. 


III. AN OPTICAL RECEIVER 


Figure 5 shows a fairly typical receiver, in schematic form, consisting 
of an avalanche photodiode, an amplifier, and an equalizer. 

The amplifier is modeled as an ideal high-gain infinite-impedance 
amplifier with an equivalent shunt capacitance and resistance at the 
input and with two noise sources referred to the input. For the purposes 
of this paper, the noise sources will be assumed to be white, Gaussian, 
and uncorrelated. Extensions to other amplifier models will be straight- 
forward when the techniques of this paper are understood.‘ 

It is assumed that the amplifier gain is sufficiently high so that noises 
introduced by the equalizer are negligible. 


t With this model, the noise sources of the amplifier do not change when the input 
and output load circuitry changes. 
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The power falling upon the detector will be assumed to be of the form 
of a digital pulse stream 


pt) = & buhg(t — kT), (5) 


where b; takes on one of two values for each integer value of k, 7 = the 
pulse spacing, h,(t — kT) = pulse shape and is positive for all t. We 
shall assume /-2h,(é — kT)dt = 1, therefore 6, is the energy in 
pulse k. The assumption that the received power will be in the form (5) 
appears reasonable for intensity modulation and fiber systems of 
interest.! 

From (1) we have the average detector output current (7,(t)) given 
by 

Gilt) = LLPO + Gero, 
where 
(g) = average detector internal gain 


e = electron charge 
Xo 

es 
zg PY) 


dark current electrons per second 


average optical primary electrons per second. 


Therefore, the average voltage (neglecting de components) at the 
equalizer output is 


(ooutt)) = AMDPO 4,0) shaald (6) 


66g) 
* 


where indicates convolution and A is an arbitrary constant. 


het) = F 4 -—*+_— 
BR: + jo(Ca + Ca) 


= amplifier input circuit current impulse response, 
1 A re : 
Rr=|>-+> = total detector parallel load resistance, 
Ry Ra 


and heq(t) = equalizer impulse response. 
Clearly, (vour(t)) is of the form 


(Wout(t)) = ps bihout(t > kT) (7) 
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and vout(é) is of the form 


vourlt) = 2 Dehoult — kT) + n(0), 


where n(¢) represents deviations (or noises) of vour(é) from its average. 

The fundamental task ahead is to pick R, (the bias circuit resistor) 
and heq(t) so that a system which samples vour(t) at the times {k7'} can 
make decisions as to which value b; has assumed (by comparing the 
sample to a threshold) with minimum chance of error. 


IV. CALCULATING SIGNAL—TO—-NOISE RATIO IN TERMS OF THE EQUALIZED 
PULSE SHAPE 


Having defined the receiver and its statistics in the above sections, 
we can now calculate the variance of n(t), the noise portion of the out- 
put vour(é) of the system of Fig. 5, defined as follows: 


N = ((n(t))?) = (ourlt)) — Wour(é))?. (8) 
The noise, NV, of (8) above depends upon the coefficients {b;} defined in 
(5) and upon the time t. 
We shall first of all restrict consideration to the set of timest = {kT} 
when a decision as to the values {b;} will be made by sampling vour(E). 
We shall next assume that the equalized pulses satisfy 


Nout (0) = 1 (9) 
Rour(t) = 0 for t= kT, k #0. 


That is, we shall assume that the equalized pulse stream has no inter- 
symbol interference at the sampling times k7’.t Therefore, 


Vou(kKT) = be + n(kT). (10) 


In eq. (10) the noise, n(t), still depends upon all the {b,} and the 
time ¢. This is a property which distinguishes fiber optic systems from 
many other systems where the noise is signal-independent and sta- 
tionary (not time-dependent). Consider, without loss of generality, 
the output, v(¢), at ¢ = 0. We define the worst-case noise, NW(bo), 
for each of the two possible values of bo as follows: 


NW (bv) = leno [veut (0) ) = (Vour(0))? J, (11) 


where in (11) the maximization is over all possible sets {b,} for k ¥ 0, 
and where by can take on either of two values as previously stated. The 


t The limitations imposed by this assumption are discussed in Section VII. 
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quantity NW(6o) shows, for the two possible values of bo, what the 
noise for the worst combination of the other symbols is. 

We shall next calculate (v2u:(t)) — (vour(t))? as a function of the set 
{bi}. 

Examine Fig. 5. We shall define the two-sided spectral density of the 
amplifier-current noise source 7,(t) as Sr; and the two-sided spectral 
height of the amplifier-voltage noise source e,(t) as Sz. The two-sided 
spectral density of the Johnson-current noise source 7,(t) associated 
with R, is 2k9/R,, where k is Boltzmann’s constant and @ is the ab- 
solute temperature. 

We can write the output noise as follows: 

Vout(t) — (Wout(t)) = ns(t) + mr(t) + nit) + ned), (12) 

where 

ns(t) is the output noise due to the random multiplied Poisson pro- 
cess nature of the current 7,(t) produced by the detector, 

nr(t) is the output noise due to the Johnson noise current source of 
the resistor Rs, 

nr(t) is the output noise due to the amplifier input current noise 
source 7,(¢), and 

nx(t) is the output noise due to the amplifier input voltage noise 
source é,(t). 


We have 
(uzur(t)) —_ (Wour(t) )? 
= (Your) — (Wout (t) ))?) 
= (ng(t)) + (ne(t)) + (ni(t)) + (ne(t)) 
= (nk) + (2k0/Rs) = [~ |Hao) <—G 
Ri Ra 
1 


2 


J dw 


+ jw(Ca + Ca) 


2 


dw 








1 09 
+ (81) 5: f | Hex(e) —4——_ 
ai RB: a Ra + jw(Ca + Ca) 








+ (Sx) 5 [" |Haa(w)|%de. (13) 


In (13), the last three terms were evaluated using the well-known for- 
mula for the average-squared output of a filter driven by white noise. 
We must now calculate the “shot noise” term (n3(t)). 

Recall that 7,(¢) consists of impulses of random charge corresponding 
to “bunches”’ of electrons with a random number g per bunch, this 
number being independent from bunch to bunch. 
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Consider a finite interval of duration L. Let g; be the number of elec- 
trons in bunch k in the interval; where the bunches are labeled not in 
order of time but at random. Let ¢, be the arrival time of bunch k. 
Let hr(t) be the response of the RC circuit, amplifier, equalizer com- 
bination to a current impulse from 7,(t). Then the output vour(é) just 
due to the current 7,(t) in the interval L is 


tag(t) =  eguhi(t — te), (14) 


where JN is the number of bunches. 
Recall that the probability density of N bunches at the times {¢;} is 


e~A iat A(Ex) 
PLN, {t.} ] = a ) (15) 


where 


Mt) = p(t) 5 3g + ho 


Thus combining (14) and (15) and leaving out some tedious algebra we 
obtain!® 


oh) =f ela )(p)n/AL + Aa)ha(t — edt’. (16) 


In a similar manner, we obtain 


((halt))®) — bal) =f ea") (pee) hj + da) HAC = ed, 


nterval L 


where (g) is the mean internal gain of the detector and (g?) is the mean- 
squared internal gain. 
We therefore obtain, letting L — ©, the result 


(n3() = lim C(Cobn()) — (obu(0))*I 


=[~ e2(g?) {Cz bh, (t’ — kT) 156 qt no hi(t — t’)dt’. (17) 
Further, 
1 


Hy(w) = Fihi(t — t')} = Heq(o) 5 SSSR: Inne rae eae (18) 
R, = R, sh jo(Ca + Ca) 
b A 


Thus we have the remaining term in (13) in terms of the input optical 
pulse, the equalizer response, and the RC circuit at the amplifier input. 
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Converting everything to the frequency domain and recalling that 
we have normalized the equalized output pulse h,u:(t) to unity att = 0, 
we obtain 


vo, ae) BRIS Hao Eben) 


(dx,£ 0} 








(ie) Hite) ) 


sts Ce 8+ ete) 


—o 


foo} 


Ayour(w) 


H(«) 








2 
da 








Hels) (ze + + jo(Cst C4) ) 
H ,(w) 





| , (19) 





(22/7)? . 
T 27(g)*e? Sef 


where 


H,(w) = F{h,(t)} = input power pulse transform, 
Aou(w) = F {hour(t)} = output pulse transform, 


‘x’? = convolution, 


bo = coefficient multiplying zeroth input pulse, 
and 


1 [~e] 
= f- Hour(w)deo = 1. (20) 


In principle, we wish to minimize NW(bo) by choosing Ry and H.q(w) 
for the worst-case combination of symbols {b;,}, subject to the zero 
intersymbol interference condition on the ouput pulse stream dour(t) 
[recognizing that we have normalized Aou:(t) and Hour(w) as given in 
(9) and (20) above ]. 


4.1 Comments 


(t) One observation, which follows regardless of the choice of 
Hout(w), is that the noise is always made smaller when R, is 
increased. Therefore, subject to practical constraints and for 
a fixed amplifier and a fixed desired output pulse shape (which 
is determined by the equalizer and R,), it is always best to 
make R», the bias circuit resistor, as large as possible. 

(zi) It is also clear, from (17) and the fact that the input pulse 
h,(t) is positive for all ¢, that the worst-case noise occurs when 
all the b, (except bo) assume the larger of the two possible 
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values. Recall that we are interested in the noise for both values 
of bo. 

(at) Furthermore, for a given Sz and S; and a given output pulse 
shape, it is desirable that the amplifier input resistance be as 
large as possible and that the amplifier shunt capacitance be as 
small as possible. 

(2v) It is desirable that the diode shunt capacitance be as small as 
possible. 


V. CHOOSING THE EQUALIZED PULSE SHAPE 


In principle, using (19) and given H,(w), (g), (g), Sz, Sz, Ro, Ra, 
Ca, and C’, one can find the equalized pulse shape Hout(w) for each value 
of bo that minimizes the worst-case noise. 

In practice, other considerations in addition to the noise are also of 
interest. In particular, it is important not only that the intersymbol 
interference be low at the nominal decision times k7, but that it be 
sufficiently small at times offset from {k7} to allow for timing errors 
in the sampling process. 

Therefore, rather than seeking the equalized pulse shape that mini- 
mizes the noise, we shall consider various equalized pulse shapes to see 
how the noise trades off against eye width. 

Before proceeding, it is helpful to perform some normalizations upon 
(19) to reduce the number of parameters. 

Make the following definitions: 


=] 
Rr (z. + aby ) = total detector parallel load resistance, (21) 
Ry, Ra 


Cr = Ca + Ca = total detector parallel load capacitance, 
bmax = larger value of bx, bmin = smaller value of bi, 


Hyo) = H, (7), 


27rw 


' if 
Aour(w) = 7p Hout (=F): 


In this normalization, the functions H,(w) and Hy,(w) depend only 
upon the shapes of H,(w) and Hout(w), not upon the time slot width 7. 
The previous normalizing conditions on H,(w) and Hour(w) imply con- 
ditions on H;(w) and Hou(w) 


H,(0) = 1>4H;(0) = 1 (22) 


DIGITAL FIBER OPTIC REPEATER I 855 
which implies 


/ "hates. 
Also, 


how(0) = 1 5= [” Houla)do = 19 [Heal df = 1. 
With the above normalizations, (19) becomes 


SHOT re 


NW(bs) = (> ) ie (9") 1 Kr, ra ee a 























(g)? i 
sae ae 
T ke . | Se 
+ oy: :| 8 +B + (g2)e?da +3 an 
Pm NOISES 
sp, ena) ed zis} (23) 
T((g)e)? , 
where 
an - 7 Aout(f) Howl f) 
=f" BD) FF Hii) "Hh | ao 
_ < ' Aour(k)  Hou(h) 
Dam 2) | Fey Tw | 
Be tog er GP ils 
=f" Gy | a 
_ Hoist) |* 
ar ee mGy | Pas 





In (23), the first shot-noise term is due to the pulse in the time slot 
under decision, the second term being shot noises from the other pulses 
which are assumed to be all “on.” From this normalized form of (19), 
we see that for a fixed input pulse shape and a fixed output pulse shape 
and with fixed R,, Ra, Cr, Sz, and Sz, the noise decreases as the bit 
rate, 1/7, increases (a consequence of the square-law detection) until 
the term involving J; dominates. After that, the noise increases with 
increasing bit rate (due to the shunt capacitance C7). 
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1 -aT at 
o— R — <t<—,0 OTHERWISE 
hp(t) = —— FOR — : 





1 
27 at 





hp(t) = exp {— 12/12 (at)7J} 





hplt) = <- ex {-1/ (aT) } 


Fig. 6a—Input pulse families. 


Example of Normalization: 


Suppose the input optical pulse is a rectangular pulse of unit area 
having width equal to one-half a time slot 7; then 
at De 
H,(w) = / a ettdt 


—iT 


== (3) (etot/4 — giv /4) 


. (wT /4) 


= sin a ae (24) 
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Hour!) 


sin(at/T) cos (7 Bt/T) 


B=0.1~3% Hour (t)= 


at/T [1-(28t/T)2] 





Hoyt t) 





WORST- CASE EYE DIAGRAMS 


Fig. 6b—Frequency domain, time domain, and eye diagram representations of 
raised cosine family. 


Therefore, 


; _ 2rf\ _ sin (rf/2) 
HAD) = Hy (AF) = BE. 
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As expected, the normalized pulse spectrum H,(f) is independent of 
the time slot width J and merely reflects the fact that the pulse h,(t) 
is a rectangular pulse with width equal to half a time slot. 

In order to obtain the noise for various input and output (equalized) 
pulse shapes, one needs to calculate the three integrals J,, J2, and 7; and 
the sum >>3. 

Consider the following three families of input pulse shapes (see Fig. 
6a) and single family of output pulse shapes (see Fig. 6b). 


(¢) Rectangular input pulses: 


h(t) = a ~t <t< aes 0 otherwise (25) 


sin (ar sin (af) | 


H,(f) = anf 
(72) Gaussian input pulses: 
h(t) = = e-[#2/2(aT)?} 
. V2naT 
AA f) = erase, 
(ti7) Exponential input pulses: 
helt) = ta erter 
2 aT 


’ 1 
H,(f) = T+ jonas 


(tv) “Raised cosine’ output pulses: 
ea ee wt wBt wt _ 2pt 2\ ]-1 
tout = [sin (5 eos (Fr) [EO - (Fr) )| 


(1 — 8) 
2 


=5[1-sn(F-5)]. for + <i <3 


= 0 otherwise. 


Aour(f) = 1, for 0 < | f| < 
+ B 





(Time, frequency, and eye diagram representations of the 
raised cosine family are shown as a function of 8 in Fig. 6b.14) 


In Figs. 7 through 18 calculations of 11, Zz, I3, and I, are given 
graphically for each input pulse family as a function of a and 8. 


DIGITAL FIBER OPTIC REPEATER I 859 





0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 7—Rectangular family 7 vs a and 8. 


5.1 Comments on the Numerical Results 


For the rectangular input pulses with widths between 0.1 and 1 time 
slot, I1, >-1, 22, and I3 vary very little. Thus, if one expects to receive 
rectangular optical pulses which are fixed in energy, the required en- 
ergy per pulse is insensitive to the pulse width for widths up to 1 time 
slot. 

The curves for Gaussian-shaped input pulses imply very strong sen- 
sitivity of required energy per pulse to pulse width. This is a conse- 
quence of the rapid falloff of the spectrum of a Gaussian pulse with 
frequency. It is suspected that, although for certain fiber systems the 
received pulses may appear approximately Gaussian in the time do- 
main, the frequency spectrum will not suffer such a rapid falloff. The 
results for the exponential-shaped input pulses seem much more 
realistic. 

For exponential-shaped optical pulses we notice, from Figs. 15 and 
16, that the shot noise coefficients 7; and >°; are sensitive to the optical 


4 





0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 8—Rectangular family 2, vs a and £. 
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0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 9—Rectangular family 72 vs a and £. 





0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 10—Rectangular family J; vs a and £. 


1000 


100 





0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 11—Gaussian family J; vs a and 8. 
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1000 


100 


24 





Qc. =p 0.4 0.6 0.8 1.0 1.2 
Qa 


Fig. 12—Gaussian family 2, vs a and £. 


pulse width, but that these sensitivities imply a practically useful 
tradeoff in required optical power vs allowable bit rate. That is, one 
might take a certain power penalty to allow equalization which can 
substantially increase the usable bit rate on a channel having a fixed 
optical output pulse width. The sensitivity of 7, and J; to the optical 
pulse width is similar to that of >>; and less significant because in- 


1000 


100 


Ip 





0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 13—Gaussian family 7, vs a and 8. 


862 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1973 





0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 14—Gaussian family I; vs a and B. 


creases in the thermal noises of the receiver are for the most part com- 
pensated for by adjustment of the avalanche gain, with only a small 
penalty in excess shot noise. The above statements will be made quan- 
titative in Section VI. 


VI. OBTAINING THE RELATIONSHIPS FOR FIXED ERROR RATE BETWEEN 
THE REQUIRED ENERGY PER PULSE, OPTIMAL AVALANCHE GAIN, 
AND OTHER PARAMETERS 


Suppose that in (23) all parameters are fixed except (g), (g?), Dmin, 
and bimax. 


B= 1.0, 0.1, 0.5 





0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 15—Exponential family J: vs a and 8. 
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o O02 0.4 0.6 08 1.0 1.2 
Qa 


Fig. 16—Exponential family 2; vs a and £. 


The receiver equalized output at the sampling time, due to an opti- 
cal pulse of energy bo, is bo. 

When b¢ = bmin, we must be sure that the probability that noise 
drives the receiver output vour(t) at the sampling time above the 
threshold D is less than 10~°.t Using the signal-to-noise ratio approxi- 
mation,? we require the noise variance, NW(imin), to be less than 
{¢LD — bmin J}? 

Therefore, we require that 

NW(bmis) S 35 (D — bin P. (26) 


Furthermore, when bo = bmax we must be sure that the probability 
that the noise drives the receiver output below the threshold is less 
than 10~°. Therefore, we require that 


NW (b max) = 3 [b max = D¥. (27) 


t An error rate of 10~° is arbitrarily chosen here. Dependence of required optical 
power on error rate is discussed in Part II of this paper. 
+ See Appendix A. 


864 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1973 


0 
100 
80 B= 0.1 


0.5, 1.0 





0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 17—Exponential family J, vs a and 8. 


Using equality in (26) and (27), we require for a 10~° error rate 
VN W (0 max) + VN W (min = §(bmax ae Owain) (28) 


Very often we have a fixed ratio (bmin/Dmax) = p. 
Rearranging (28) we obtain 


b= a [VNW(bmex) + VN W(pb mec) J) (29) 


In order to obtain numerical results, we shall make the following 
reasonable assumptions. Let the dark current be negligible and let 
bmin/Omax be much less than unity. Therefore we shall set A» = 0, 
bmin = 0.1 We obtain from (23) 


t Quantitative discussion of the consequences of these approximations are given 
in Part II. 
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0 0.2 0.4 0.6 0.8 1.0 1.2 
a 


Fig. 18—Exponential family J; vs a and £. 


NW (bo) & |= | \@ 56 [boli + bmax(2i1 — 11) ] + Gy val 


Za 1a[S+ oot fi n+ Se ee sel. (30) 


In (80), Z includes all the thermal noise terms of (23). 
From (29), taking the limit as p > 0 (bmin > 0), we obtain the con- 
ditions to achieve a 10~° error rate as follows. 
Case I: Thermal noise (Z) dominates (i.e., little or no avalanche 
gain). 





12%Q 


max = gy ey 
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Case IT: Optimal gain (i.e., (g) adjusted to minimize the required 
optical energy in an “‘on”’ pulse Bmax). 
Let the relationship between (g?) and (g) be specified in the usual 
way: 

Gps, (32) 
where x depends upon the type of detector. We obtain the following 
formula for the optimal gain: 

(G)optimal = (6)~NOF=(Z) M2422) (4) 1/2422) (yo) “t=, (38) 


where defining J; = >-; — J, [see eq. (23) ] 


; =[ Sie ds | (Sit 4+ PE F9 7, 


aS es Oe (34) 
yo = V1/y1 + Is + V1/y1 + Da. 


We obtain the following formula for Bmax: 


ere 


That is, 


= na (6) (2+2) I(+2)(Z)) #/(2+22) (m1) x{(2-+2z) (y2) (24-2) /(1+z) | (35) 
” 


Dinax © [Z #12422), (36) 


We therefore see that for these assumptions and x = 0.5 correspond- 
ing to a silicon avalanche detector the minimum required energy per 
pulse varies as the one-half power of the thermal noise term, Z, without 
avalanche gain, and as the one-sixth power of the thermal noise term, 
Z, with optimal gain. 

However, this does not mean that at optimal gain the value of Z is 
unimportant. By reducing Z (the thermal noise terms) through proper 
choice of biasing and amplifier circuitry, we still minimize the opti- 
mizing avalanche gain [see (33) ] and obtain some reduction in the 
required energy per pulse (see Part IT). 


6.1 Example 


From eqs. (23), (30), (34), and (85) we can calculate, for various 
shaped optical pulses, the effect of intersymbol interference on the re- 
quired energy per ‘‘on’” pulse (bmax) and therefore on the required 
average optical power needed for a 10-° error rate. We shall assume 


+ That is, if pulses are “on” half the time, the required optical power equals 


Bmax 
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Fig. 19—No avalanche gain. 


that the detector amplifier shunt resistance Rr is sufficiently large so 
that the term ((27C7)2/Te?)SzI3 dominates the thermal noise in (23) 
and (30). 

The minimal required optical power is obtained for very narrow op- 
tical input pulses.t For other pulse shapes, the excess required optical 
power can be defined as a penalty in dB for not using narrow pulses. 
This penalty is plotted in Figs. 19 and 20 for the case of no avalanche 
gain and optimal avalanche gain using the pulse shapes of (25), as- 
suming a silicon detector (2 = 0.5). In those figures, the abscissa is 
the normalized rms optical pulse width defined as follows: 


Za ( if th (tat) - ( / thg(t)at), 
T? T? ) 


where 7 = time slot width. 





(37) 


VII. CONCLUSIONS 
7.1 Conclusion on Choosing the Biasing Circuitry 


From the results of Sections IV and VI, and from (23), it is clear 
that, to minimize the thermal noise degradations introduced by the 
amplifiers following the detector, it is necessary to make the amplifier 
input resistance and the biasing circuit resistance sufficiently large so 
that the amplifier series noise source dominates the Johnson noise of 
these parallel resistances. When designing the amplifier, one should 
keep in mind that for a silicon avalanche detector the required optical 


+See Appendix B. 
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EXPONENTIAL 


GAUSSIAN 


POWER PENALTY IN DECIBELS 


RECTANGLE 





0.1 0.2 0.4 06 O08 1.0 2 4 6 8 10 
a/T 


Fig. 20—Optimal gain. 


energy per pulse at optimal gain varies roughly as the one-sixth power 
of the thermal noise variance at the receiver output, and therefore it 
is not wise to spend too much money on thermal noise reduction. On 
the other hand, if one is not using avalanche gain, the required energy 
per pulse varies roughly as the one-half power of the thermal noise 
variance at the receiver output. 

In order to minimize the effects of the thermal noise, the total capaci- 
tance shunting the detector should be as small as possible and the 
equivalent series thermal noise source of the amplifier should also be 
as small as possible. 


7.2 The Effect of Bit Rate on Required Energy Per Pulset 


The effect of bit rate on the required energy per pulse is small if the 
received pulses remain well confined to a time slot. In (23), assume J, 
> 1, Ie, and J; are fixed corresponding to a fixed received pulse width 
relative to a time slot. Then the shot noise terms due to the signal are 
independent of the bit rate 1/7, and the shot noise due to the dark 
current decreases with increasing bit rate. If the series noise from the 
amplifier dominates, then the thermal noise increases with increasing 


t This subject will be discussed in more detail in Part IT. 
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bit rate, but is for the most part compensated for by the avalanche gain 
with little penalty in required energy per pulse. 

If considerable equalization is being used, then the required energy 
per pulse increases with the bit rate (because a higher bit rate neces- 
sitates greater equalization). For the equalization assumed above, 
where the equalized pulses are forced to go to zero at all sampling times 
except one, the required energy per pulse is a strong function of the bit 
rate. For example, with exponential-shaped received pulses, the re- 
quired optical power at optimal avalanche gain was roughly 6 dB higher 
for a pulse 1 time slot wide to the 1/e point compared to a pulse only 
0.25 time slot wide to the 1/e point (see Fig. 20). 

On the other hand, it is clear that zero-forcing-type equalization is 
not optimal, particularly for received pulses whose spectra fall off 
rapidly with frequency. It is more likely that some compromise between 
eye opening and output noise variance results in minimum required 
energy per pulse. 

For the assumed zero-forcing equalization, we still can conclude that 
a usable tradeoff exists between required energy per pulse and bit 
rate, and this will allow some extension of the usable rate on “disper- 
sion-limited’’ fibers. 


7.3 Comments on Previous Work 


The purpose of this paper has been to illustrate the application of 
the “high-impedance” front-end design to optical digital repeaters, to 
take into account precisely the input pulse shape and the equalizer- 
filter shape, and to obtain explicit formulas for the required optical 
power to achieve a desired error rate as a function of the other 
parameters. 

Previous authors!?:!* working in the areas of particle counting and 
video amplifier design have recognized that a high-impedance front 
end followed by proper equalization in later stages provides low noise 
and adequate bandwidth. However, optical communication theor- 
ists®.§-14.15 have in the past often used the criterion “RC S T’’-loading 
down the front-end amplifier so as to have adequate bandwidth with- 
out equalization—therein incurring an unnecessary noise penalty. Some 
optical experimenters!*!”7 have recognized the high-impedance design 
for observing isolated pulses or single frequencies, but failed to recog- 
nize the use of equalization. 

Many previous authors**®!5 have used simple formulas (which 
usually assume isolated rectangular input pulses and a front-end band- 
width of the’ reciprocal pulse width) to obtain the required power in 
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optical communication systems for a desired signal-to-noise ratio. 
Often these formulas average out the signal-dependent nature of the 
shot noise. If modified to include the high-impedance design concept, 
such formulas are very useful for obtaining ‘‘ball park” estimates of 
optical power requirements. Such formulas are, in general, special 
cases of the formulas described here. 


7.4 Experimental Verificatcon 


In work recently reported,!* J. E. Goell has shown that, in a 6.3- 
Mb/s repeater operating at an error rate of 10-°, agreement of experi- 
mentally determined power requirements and the above theory were 
within 1 dB (0.25 dB in cases without avalanche gain). In particular, 
using an FET front end and the “high-impedance” design, the optical 
power requirement without avalanche gain was 8 dB less than with the 
front end loaded down to the “RC = T”’ design. 


APPENDIX A 
Signal-to-Noise Ratio Approximation 


In this paper we have calculated the mean voltage (Omax or Dmin) 
and the average-squared deviation from the mean voltage (NW (bmax) 
or NW(bmin)) at the receiver output at the sampling times. In order to 
calculate error rates, we shall assume that the output voltage is ap- 
proximately a Gaussian random variable. This is the signal-to-noise 
ratio approximation. Thus if the threshold, to which we compare the 
output voltage, is D, and if the desired error probability is P., we have 








= | : eet es Cee eo hear on om (38) 
TO5 
where 
o2 = NW(min) 

and 

1 e 9 

Tal exp L-( = Dmax)?/207 dv = P,, 

TO. —7 

where 


oa = NW (0 max)- 


Changing the variables of integration we obtain the following ex- 
pressions, equivalent to (38): 


/ " e-#dz = P,, (39) 
Q 


=> 
3 
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Q 
4 
3 
2 
1 
1072 10-3 10-4 10-5 ~=— 10-6 10-7 10-8 = 10-9 10-19 49-11 49-12 
P. 
1 wo 
Water. Ppa i ead, 
o 8" S28 Ja 
where 
Q = (D _ Dinin) / Oo 
and also 


Q = (Omax = D)/c1. 
Thus we must have 


oo = VNW(bmin) 


(D = bmin)/Q 
and 


= VN W (8 max = (bmax a D)/Q. 
Therefore we must also have (eliminating D) 


VN W (8 max) ++ VN W (6 min 4 (bmax = bmin)/Q. 


The value of Q is determined by the error rate through (89) above. 
Figure 21 shows a plot of Q vs P. which can be obtained from standard 
tables. 

Equation (39) states that the threshold must be Q standard devia- 
tions (of the noise at bmin) above bmin, and also must be Q standard 
deviations (of the noise at bmax) below bmax to insure the desired error 
rate. For an error rate of 10~° (P. = 107°) Q is roughly 6 (5.99781). 
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APPENDIX B 
Optimal Input Pulse Shape 


We now wish to show that the optimal input pulse, h,(¢) shape 
(which minimizes the required average optical power) is ideally an 
impulse; and for practical purposes a pulse which is sufficiently narrow 
so that its Fourier transform is almost constant for all frequencies 
passed by the receiver. To do this, we shall show that such a narrow 
pulse minimizes the noises NW(bmax) and NW(bmin) defined in (23). 

We begin with the already established condition that the area of 
h,(t) is equal to unity and that h,(t) is positive (power must, of course, 
be positive). 


i MOe= Ts he) >S 6. (40) 
These conditions imply the following weaker condition: 


|Hp(f)| = 





[roWerseerat 





< [ih ilemrrlat = fag Qdt = (0) = 1. (41) 


Consider first the thermal noise terms of (23) involving the integrals 
I, and Is: 


_ f LHou( A)? 4 ¢. ee clo e) eae 
sees 070) cama a7 ie 

Using (41) in (42) we see that these terms J, and J; are minimized for 
any desired output pulse Hou(f) by setting |H,(f)| = H>(0) = 1 for 
all frequencies, f, for which | Hju(f)| > 0. Thus, ideally, to minimize 
I, and I3, h,(t) is an impulse of unit area which also satisfies the condi- 
tions (40). 

We must now show that the shot noise terms of (23), J, and >>; — Nh, 
are minimized by a very narrow pulse h,(t). 

First recall that (301 — J1)0max(#2/n)(g?)/(g)?_ is the worst-case, 
mean-square shot noise at the sampling time due to all other pulses 
except the one under decision, and assuming all of those pulses are ‘‘on”’ 
(by = Bmax for k ¥ 0). Thus, from (17), we obtain 


d1i—- 11,2 0, (43) 
where 


yi —- Th = {oro — kT))hi(-t))dt' 
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and where /;(t) is the overall receiver impulse response relating h(t) 
to hour(t). 

Now let the optical pulse h,(t) be an impulse of unit area. Then the 
overall impulse response hy; must be equal to hour(t) and therefore using 
(43) and (9) we obtain 


D1 -h= x heut( —kT) = 0 
(for h,(t) = 6(t)). 


(44) 


Because condition (9) requires zero-crossing equalization, we have 
shown that an impulse shape for h,(t) minimizes (removes) the shot 
noise from pulses other than the one under decision. 

Finally, consider the shot noise from the pulse under decision given 
by 11(42/n)b 0(g?)/{g)? where 


1 = [ hg(t)na(—t)at’ > 0. (45) 
We already have the condition (9) 
hous(0) = f hg(t’)ha(—t)at! = 1. (46) 
We can next use the Shwarz inequality on (46) 
(how())* = 1 = ( fHBMDAA—Hat) 
< [hy(tat f hy(eyne(—dat. (47) 


Since fh,(t)dt = 1, we have from (47) and (45) 
I, 21. (48) 


Now set h,(t) equal to a unit area impulse. It then must follow from 
(46) that h7(0) = 1. We finally obtain 


/ hp(t)h3(t)dt = h3(0) = 1. (49) 


From (48) and (49) we see that an impulse-shaped h,(t) makes J, 
achieve its minimum value of unity. 

Summarizing, an impulse-shaped optical input pulse h,(t) (for prac- 
tical purposes a sufficiently narrow pulse so that its Fourier transform 
is approximately constant for all frequencies passed by the receiver) 
minimizes all the pulse-shape-dependent coefficients (11, }01 — 1, Ie, 
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and J;) in the noise expression (23) and thereby minimizes the required 
average optical power to achieve a desired error rate (using the signal- 
to-noise ratio approximation of Appendix A). 
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Receiver Design for Digital Fiber Optic 


Communication Systems, II 


By S. D. PERSONICK 
(Manuscript received January 15, 1973) 


This paper applies the results of Part I to specific receivers in order to 
obtain numerical results. The general explicit formulas for the required 
optical average power to achieve a desired error rate are summarized. A 
specific receiver is considered and the optical power requirements solved 
for. The parameters defining this receiver (e.g., bit rate, bias resistance, 
dark current, etc.) are then varied, and the effects on the required optical 
power are plotted. 


I, INTRODUCTION 


This paper will apply the theory of Part I to illustrate in detail how 
the required received optical power in a digital fiber optic repeater 
varies with the parameters such as the desired error rate, the thermal 
noise sources, the bit rate, detector dark current, imperfect modula- 
tion, etc. We shall begin by first applying the formulas of Part I to 
a specific realistic example to obtain reference point. We shall then 
derive curves of how the required power varies around this point as we 
vary the system parameters. 


II. REVIEW OF RESULTS OF PART I 


In Part I we derived explicit formulas for the required optical power 
at the input of a digital fiber optic communication system repeater to 
achieve a desired error rate. One formula was applicable when little 
or no internal (avalanche) detector gain was used, so that thermal noise 
from the amplifier dominated. The other formula was applicable when 
optimal gain was being used. These formulas are repeated below: 


Prequired = re (Thermal Noise Dominates) (1) 
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where 
= P 2ké Sr (21C 7) 2Sals P 
z= {Ese e+ Be | + CeO Sel (1a) 
1 (2+2) /(1+2) {(2+2z) /(2+22) 2+2) /(1+z) AQ 
Prequired = oT (Q) 2 ‘: [(Z)* * (y1)” " (Y2)¢ mY CES aS ’ 


(Optimal Avalanche Gain) (2) 
where 


Goptimal = (Q)~1 +2) F(Z) 1 G+22) (41) 1/2422) (qo) U2) 7], 
where (referring to Fig. 1) 


n/h® = detector quantum efficiency/energy in a photon 
T = interval between bits = 1/bit rate 
G = average detector internal gain 
G* = detector random internal gain excess noise factor 
@ = number of noise standard deviations between signal and 
threshold at receiver output. Q = 6 for an error rate of 107°. 
(See Fig. 21 in Part I for a graph of error rate vs Q.) 
e = electron charge 
ké = Boltzman’s constant-the absolute temperature 
Rr = total parallel resistance in shunt with the detector including 
the physical biasing resistor and the amplifier input resis- 
tance 
Ry» = value of physical detector biasing resistor 
Cr = total shunt capacitance across the detector including the 
shunt capacitance of the detector and that of the amplifier 
Sr = amplifier shunt noise source spectral height (two-sided) in 
amperes?/Hz 
Sz = amplifier series noise source spectral height (two-sided) in 
volts?/Hz. 


Ie, I3, ¥1, and yz are functions only of the shapes of the input optical 
pulses and the equalized repeater output pulses, where the length of 
a time slot has been scaled out. These functions are defined in eqs. (23) 
and (84) of Part I. 

Formulas (1) and (2) neglect dark current and assume perfect modu- 
lation (received optical pulses completely on or off). We shall investi- 
gate deviations from these idealizations later in the paper. For silicon 
detectors and bit rates above a few megabits per second, these idealiza- 
tions are reasonable approximations. 
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Fig. 1—Receiver. 


III. A TYPICAL OPTICAL REPEATER 


Consider the following practical optical repeater, operating at a bit 
rate of 2.5 X 107 bits per second and an error rate of 10~°. The detector 
is a silicon device with excess noise exponent x = 0.5, quantum effi- 
ciency 75 percent, dark current before avalanche gain of 100 pico- 
amperes, and an operating wavelength of 8500 angstroms. The front- 
end amplifier is a field-effect transistor in a common-source configura- 
tion. The total shunt capacitance across the detector is 10 pF. The de- 
tector biasing resistor is 1 megohm. The amplifier input resistance is | 
megohm. The amplifier shunt-current noise-source spectral height is 
equal to the thermal noise of a 1-megohm resistor. The amplifier series- 
voltage noise-source spectral height is equal to the thermal noise of 
a conductance with a value equal to the transistor transconductance, 
Jm, Which is 5000 micromhos. The received optical pulses are half-duty- 
cycle rectangular pulses. The desired equalized output pulse is a raised 
cosine pulse [see Part I, eq. (25) ] with parameter 8 = 1. 

We must first calculate the value of Q which depends only upon the 
desired error rate. From Part I, Fig. 21, we see that for an error rate of 
10-°, Q = 6. 

Next we must obtain the constants 2, I3, yi, and y2. These depend 
only upon the input optical pulse shape and the equalized output pulse 
shape. From (23) and (84) of Part I we obtain 


I, = 0.804046, I; = 0.071966, v1 = 21.4106, 


ye = 1.25424. (3) 
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Using the above data we obtain the thermal noise parameter Z as 
follows: 


: 2k6 


7 
4 x 1078 a —21 —6 —6 
v 


é 
I» 
4x 10°” 


T 5 xX 107° 


Vi 
) 0.804046 


Is 
—21 wv 
(2x X 10-4)? ( aver ae ) (0.071966) 


= 5 
(1.6 X 10-")2(4 X 10-8) 4.8027 X 105. (4) 


+ 


From the data we have #0/yn = 3.117 X 107° joules. 

We obtain from (1), at unity internal gain (no avalanche), Drequirea 
= 3.25 X 10-8 watts = —44.89 dBm (no gain). 

We obtain from (2), at optimal avalanche gain, Drequirea = 1.6409 
X 10-9 watts = —57.85 dBm, Goptimar = 56.89. 


20 


= 
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NO GAIN 


= 
oO 


RC = T/2ar 
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10 102 103 104 10° 106 10? 
BIAS RESISTOR, R,, IN OHMS 


Fig. 2—Required power penalty vs Ry. 
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We therefore observe that optimal avalanche gain buys a 13-dB 
reduction in required optical power in this example. Before proceeding, 
we can check the validity of neglecting dark current. The average num- 
ber of primary photoelectrons produced by the signal per pulse inter- 
val 7 is the required optical power multiplied by 77'/%2. When shot 
noise is important (with avalanche gain) this number is 210 primary 
signal counts per interval 7. The number of dark current counts per 
interval T is the dark current in amperes multiplied by 7’'/e, which in 
this example is 25 primary dark current counts. Thus, the shot noise 
due to the dark current is about 10 percent of the signal shot noise. 
It is therefore a reasonable approximation to neglect this dark current 
noise. In Section VI we shall calculate precisely the effect of dark cur- 
rent upon the required optical power. 


IV. VARYING THE PARAMETERS 


In this section, we shall calculate the effect of varying parameter 
values used in the example of Section III. 


4.1 Biasing Resistor Value 


It was pointed out in Part I that the biasing resistor R, should be 
sufficiently large so that the amplifier series noise source Sz dominates 
in the expression for Z of (1a). This was in fact the case in the example 
of Section III. We can calculate the penalty in required optical power 
for using a smaller biasing resistor. This penalty is plotted in Fig. 2 in 
dB with zero dB being the penalty associated with an infinitely large 
biasing resistor. The exact penalty of Fig. 2 is applicable with the other 
relevant parameters (which make up Z) given in the example above. 
However, the qualitative conclusions are that significantly more optical 
power is needed if one adheres to the ““RC = T”’ design rather than the 
‘large R’’ (high-impedance) design, in the absence of avalanche gain. 
Figure 3 shows how the optimal avalanche gain varies when R, is 
changed. The qualitative conclusion is that the “RC = T”’ design re- 
quires significantly more avalanche gain that the “large R” (high- 
impedance) design. It should be pointed out that, for lower bit rates 
and/or a smaller capacitance C7, the improvement associated with use 
of a large R, rather than a value to keep RiCr = T is more pronounced. 


4.2 Desired Error Rate 


As mentioned before, the error rate is coupled to the parameter Q in 
(1) and (2). Figures 4a and 4b show plots of the variation in the re- 
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G/Goo 
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Fig. 3—Optimal gain penalty vs Ry, G. = optimal gain at Rp = o. 


quired power in dB with the desired error rate without gain and with 
optimal gain. The absolute power in dBm is only applicable to the ex- 
ample of Section III above. However, the difference in required power 
in dB between any two error rates is applicable in general, as should be 
apparent from (1) and (2), provided'a silicon detector (x = 0.5) is 
being used. 


4.3 Bit Rate (1/T) 


As mentioned before, the pulse spacing 7’ is scaled out of Is, Z3, y1, 
and ye. These numbers depend only upon the input and output pulse 
shapes (e.g., half-duty-cycle rectangular input pulse, raised-cosine 
equalized output pulse). Therefore, the effect of the parameter T is 
explicitly given in (1) and (2) without any hidden dependencies. (This 
of course assumes that the input pulse shape is not limited by disper- 
sion in the transmission medium and can therefore be held to a half- 
duty-cycle rectangle.) If we assume that the high-impedance design is 
being used and that this dominance of the term proportional to 1/7 
in Z of (1a) can be maintained as the bit rate is varied (becomes difficult 
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Fig. 4—(a) Required power vs error rate (no avalanche gain). (b) Required power 
vs error rate (optimal gain). 


at low bit rates), then we have the following dependence of the re- 
quired optical power upon the bit rate 1/7 without gain and with opti- 
mal gain: 


Prequired a has (no gain) (5a) 
(4.5 dB/octave of bit rate) 
; —7/6 ; 1 ; 
Prequirea & T" (optimal silicon gain) (5b) 


(3.5 dB/octave of bit rate) 
Gastiun a Tt 
(1 dB/octave of bit rate). 


One should be careful extrapolating (5a) and (5b) to very low bit 
rates. First, the shot noise is no longer negligible compared to the 
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thermal noise at bit rates where the optimal gain is low. Thus (5a) 
loses validity at very low bit rates. Further, (5b) is only valid for opti- 
mal gains greater than unity. Near unity optimal gain, the silicon 
excess noise factor departs from G:*. In addition, at low bit rates, dark 
current may not be negligible. It is reasonable to use (5a) and (5b) to 
extrapolate the results of the example in Section III to bit rates be- 
tween 5 and 300 Mb/s. 


Vv. THE EFFECT OF IMPERFECT MODULATION 


The above formulas (1) and (2) assume that there is perfect modula- 
tion. That is, it was assumed that each optical pulse is either completely 
on or completely off. In this section we shall investigate two versions of 
imperfect modulation. 


Case 1: Pulses Not Completely Extinguished 


This case is illustrated in Fig. 5. In each time slot the optical pulse 
is either completely or partly on. The partly on pulse has the same 
shape as a completely on pulse, but has area EXT times the area of 
a completely on pulse. This may correspond to an externally modulated 
mode-locked laser source. Thus the ratio of the power received when a 
sequence of all “‘off”’ pulses is transmitted to the power received when a 
sequence of all “‘on’”’ pulses is transmitted is EXT’. Using the results 
of Part I eqs. (23) and (84), we obtain the following power require- 
ments which are modifications of (1) and (2) above: 


— _ (Lt EXT \ QZ? nO . : 
Drequired = Ges = ayT) GT a” (Thermal Noise Dominates) (6) 


1+ EXT Q (2-2) /(1+2) 


4 [(Z) 2/2422) (a7) 2/2422) (9) 22) /(t+2) - (Optimal Gain) (7) 





where defining from Part I (23) and (384) 
I oe Dar > (1 = EXT)I, 


we have 
Ot + fli +19 + MEE, 
Nn = 9 X14 ie 


y2 = VI/yi + Da + VI/1i + Te. 
[Compare (6) and (7) to (1) and (2). ] 
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Fig. 5—Imperfect modulation, pulses not completely extinguished. 


Using (1), (2), (7), and (8), we can calculate the extra required opti- 
cal power due to a nonzero value of EXT with and without avalanche 
gain. When avalanche gain is being used, this power penalty depends 
upon the input and output pulse shapes. We plot in Fig. 6 the power 
penalty vs EXT, assuming the pulse shapes of the example in Section 
III above for the avalanche gain case. 


Case 2: Pulses on a Pedestal: 


This case is illustrated in Fig. 7. The received optical pulses arrive on 
a pedestal, which may correspond to inability to completely extin- 
guish the light from a modulated source which is not in a pulsing (mode- 
locked) condition. We set the ratio of average received optical power 
when all pulses are “off” to average received optical power when all 
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Fig. 6—EXT penalty (dB) vs EXT. 
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Fig. 7—Imperfect modulation, pulses on pedestal. 


pulses are “fon” to be EXT in analogy to Case 1 above. This ratio will 
remain fixed if the pulse changes in propagation from transmitter to 
receiver. Using the results of Part I we obtain the following formulas 
for the required optical power: 


= LE BT OZ 10 
Prequired —_ 1 as EXT GT ” ) 


1+ EXT (Q)@t# 10+ 
Pe 7 ERE BT 
AQ 


x Arne ga) eer de : (Optimal Gain) (9) 


(Thermal Noise Dominates) (8) 


where defining from Part I (23) and (84) 


' EXT 
D1 = D1 ar (ear) I, 


EXT 
= Ei h+ (7—aep) 


“(Lit + (cit t+ BEL won 


- Ax 
yo = VI/yi + Si + VI/at + Tr. 


Once again we can use (1), (2), (8), and (9) to calculate the penalty for 
nonzero extinction. This penalty is plotted in Fig. 8 vs HXT where we 
assume the input and output pulse shapes of the example in Section III 
when there is optimal avalanche gain. 


we have 


VI. THE EFFECT OF DARK CURRENT 


In order to allow for dark current, we must solve the following set of 
simultaneous equations which treat the dark current as an equivalent 
pedestal-type nonzero extinction. (When thermal noise dominates, 
dark current is either negligible or its shot noise can be added trivially 
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to the amplifier parallel current noise source Sy.) 
Disc ss (Z) 2/2422) (y1") 21 (2422) (4y2") (242) (+2) = . 
(At optimal avalanche gain) (10) 
where defining from Part I (23) and (34) 
hi = Lit ole 
Ig = Yi -ht+ 6h, 


+) + fcer ty + BEES yy, 


we have 


el 2dr Te 
yo = VA/y + DY + VI/yi' + Ts 
‘ 6 
d= (2D required) 7 ’ (11) 


where 7g = primary dark current in amperes. 
There are various ways to solve (10) and (11) simultaneously. One 
way is to solve (10) first with 6 = 0 for Prequirea- Then one can solve 


OPTIMAL SILICON GAIN 


PENALTY IN DECIBELS 


THERMAL NOISE LIMITED 
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Fig. 8—Power penalty vs EXT. 
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Fig. 9—Required power vs dark current. 


(11) for a new value of 6. One then resolves (10) and then (11), ete.— 
repeating the iterations until satisfactory convergence is obtained. 
Figure 9 shows a plot of the required power in dBm for the example in 
Section III vs dark current in nanoamperes. We see that a dark current 
of 100 picoamperes results in an optical power requirement which is 
about 0.5 dB more than that which would be required with zero dark 
current. Thus it was reasonable to neglect dark current when calculat- 
ing the required power in Section III. Dark current will result in even 
less of a penalty at higher bit rates. Although the curve of Fig. 9 is ap- 
plied to the specific example of Section III, it is apparent in general 
that, at bit rates above a few megabits per second and with primary 
dark currents less than 0.1 nanoampere, dark current will have a small 
effect upon the required optical power. 
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Gain-Induced Modes in Planar Structures 


By W. 0. SCHLOSSER 
(Manuscript received January 31, 1973) 


The properties of modes in a slab structure with gain in the center region 
and loss in the surrounding medium are investigated. The propagation 
constant and field distribution of the lowest-order modes are determined. 
The cutoff frequencies and propagation constants of the next-higher mode 
are given. Furthermore, the effect of a refractive index depression or in- 
crease in the center region is determined. The depression does not destroy 
the mode, as may be expected, but causes it to have a cutoff frequency. 
Comparison of these results to the experimental data shows that gain-induced 
modes play an important part in the lateral confinement in _ stripe- 
geometry GaAs lasers. 


I. INTRODUCTION 


Modes in cylindrical structures with refractive index boundaries are 
well known. Their existence and basic properties are generally visual- 
ized by superposition of plane waves reflected at the index boundaries. 
Much less is known about modes in structures with spatially non- 
uniform gain or attenuation.!;? However, it seems intuitively possible 
that some kind of mode should also exist in that case; indeed, Kogelnik® 
showed that a cylindrical structure with a radial gain profile can sup- 
port a Gaussian beam of constant diameter even if there are no refrac- 
tive index differences present. Evidently, the nonuniform transparency 
of the medium counteracts the natural tendency of the beam to spread. 

In this paper we will consider a planar structure with stepwise dis- 
continuities of gain or absorption. This geometry is of considerable 
practical interest. As will be shown in this paper, the lateral confine- 
ment of the optical field in a stripe-geometry GaAs laser is due to the 
gain-loss interface at the edge of the stripe. Furthermore, it is easy to 
create nonuniform gain distributions in planar structures either by 
masking and optically pumping or by nonuniform injection current 
distribution. The modes induced by these gain distributions can be 
easily influenced from the outside by changing the pumping intensity 
or injection current. 

887 
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II. GAIN-INDUCED MODES IN A THREE-LAYER STRUCTURE 


The physical configuration with which we will concern ourselves in 
this paper is shown in Fig. 1. All material parameters are assumed to 
be constant throughout the regions and differ only at the boundaries 
x = -ta. As is well known from the theory of refractive guiding,‘ the 
modes can be divided into four classes: TE even and odd in z direction 
and TM even and odd in x. The two lowest-order even modes have no 
cutoff frequency, whereas the odd modes can be guided only above a 
certain cutoff frequency. For our application, the lowest-order even 
modes are of greatest interest and we will deal with them first. 

Before going into details, let us explain the notation which we will 
use subsequently. 


(4) The complex relative dielectric constant ¢ is split into refractive 
index n and extinction coefficient k according to 


e = (n+ jk). (1) 


(7) uw determines the x dependence of the fields inside the center 
layer |x| S a, ie., they have the functional form 


sin ( =) 
UuU-— . 
cos \ a 
(17) w gives the dependence of the fields in the cladding |x| = a 
which follow the function e~¥!*/¢!. u and w are related by 
u? + w? = (ex = €2)(koa)?. (2) 


With these quantities we can derive the characteristic equations (the 





Fig. 1—Cross section of slab waveguide. 
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field components are listed in Appendix A) for the even modes. 
TE Mode TM Mode 


w= utanu Sw = utanu (3) 
2 


They are formally the same as in the case of purely reactive modes but, 
since €1 and e2 are complex, the solutions will be complex. 

We determine under which conditions eqs. (3) have a solution. We 
restrict ourselves to cases where the fields cannot increase exponentially 
with increasing distance from the interface nor can there be any move- 
ment of wavefronts from infinity toward the guiding structure. This 
limits w to the first quadrant. It is then easy to derive a necessary but 
not sufficient condition from eqs. (2) and (8): e1 — e2 has to stay in the 
first or second quadrant, i.e., Im (€1 — e2) > 0, which is equivalent to 


Niky — nek, > 0. (4) 
Let us interpret this inequality for some special cases: 


(1) ky = ke. Condition (4) reduces to m1 > nz which is the well- 
known requirement for refractive guiding. 

(it) nm, = ne. The condition (4) now reads ki > ke, equivalent to 
saying that the center region should be more transparent than 
the sides, which agrees with physical intuition. 

(117) my = ne + An, where An is small compared to nz but not neces- 
sarily small compared to k. In this case (4) can be expressed by 


Ak , An 

ky ae Ne > 0, (4a) 
where Ak = k; — ke. This inequality shows that there can be modes 
even if An or Ak is negative, as long as the other mechanism is strong 
enough to generate the mode. However, since (4) is only a necessary 
condition, this case requires further examination. . 

The second problem we have to address ourselves to is that of the 
cutoff frequency. For refractive guidance the lowest-order even modes 
have no cutoff frequency. We will now establish the conditions under 
which gain-induced modes have no cutoff frequency either. For small 
values of (€1 — €2)(koa)?, w and u will be small and the tangent in 
eq. (3) can be replaced by its argument. The solution of eqs. (2) 
and (3) is thus w = (e1 — €2) (ako)? for TE modes and w = (e2/e1) 
X (€1 — €2) (ako)? for TM modes with ako very small. This result 
shows that the solution of (3) exists no matter how small (ako) is, 
independent of the guide parameters, as long as «1 — ez is in the 
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first quadrant. In other words, a gain-induced mode will have no cutoff 
frequency as long as there is a refractive index difference to keep e1 — €2 
in the first quadrant. If there exists a refractive index depression, 1.€., 
Re (e1 — e€2) < 0, the modes do have a cutoff frequency. 

We will now consider the case of gain-induced modes without a refrac- 
tive index change. We specialize the discussion to small extinction co- 
efficients in the order of 10~‘, whereas n is typically greater than one. 
This covers typical laser applications reasonably well. The character- 
istic equations for TE and TM modes [eas. (8) ] differ only by a factor 
of e:/e2 = 1 + j(2/n)(ki — ke) which is sufficiently close to one to 
cause only a negligible perturbation. In the following, we will therefore 
neglect the difference between TE and TM modes. 

To simplify the discussion we define two new quantities: A normal- 
ized frequency is given by 


= akoVe: — e2. (5) 
For gain-induced modes, v reduces to the form 
v = akon 4/72 ————— (hs = = fa) (6) 


which means that in this particular case the phase angle of v is inde- 
pendent of the material parameters. It is furthermore customary® to 
use a normalized propagation constant b 
p = Palholt Se (2) a cle Shae: (7) 

€y — €2 
Since in our application e1 — e2 is a small quantity, (6./ko)? is always 
equal to e2 plus a small perturbation, whereas the variation of |b| is in 
the order of one, thus alleviating some computational problems. The 
two parameters u and w are related to v and b by u = vvl — b and 
w = ovo. 

With these new quantities, the two characteristic eqs. (3) are trans- 


formed into 
vb = V1 — btan (vv1 — 3B). (8) 


It should be noted that the normalized propagation constant 6 is 
exclusively a function of v. It is, therefore, only necessary to solve the 
characteristic eq. (8) once to cover all material parameters and di- 
mensions, assuming, of course, the validity of the initial assumptions. 

The solution of the characteristic equation was done on the com- 
puter. Figure 2 shows the normalized propagation constant b for the 
lowest-order mode as a function of |v|. We can interpret b more easily 
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0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 2.2 2.4 


Fig. 2—b(|v|) for lowest-order even mode (nm = nz). 


if we rewrite the propagation constant 8. [eq. (7) ] to show its real and 
imaginary parts. 


= = n — billy — ke) + j(ko(1 — b-) + kbs). (9) 


The attenuation or gain of the mode as represented by the imaginary 
part of 8./ko is solely dependent on b,, which varies between 0 and 1. 
For small values of b,, the attenuation will be determined by the outer 
medium and only for higher values of 6, will the gain in the center 
region be of significance. This situation corresponds very closely to the 
case of refractive guidance. For further reference we have included in 
Fig. 3 the field distribution with |v| as parameter. 

For any practical application, it is important to know up to which 
frequency or dimension the guide is single moded. We will therefore 
determine the cutoff frequency of the lowest-order odd mode and of the 
first-order even mode. We define the cutoff value of b as the one at 
which the radiation condition just ceases to be fulfilled. It turns out 
that this is the case when w is purely imaginary, i.e., only power is 
radiated away from the guiding structure, but the amplitude does not 
decrease with increasing |x|. Re (w) = 0 is equivalent with the condi- 
tion Re (6) = 0 [eq. (7) ]. The solution of this problem has to be found 
on the computer. The results are 


lvoaa| = 1.877 | veven| = 2.759. 


For future reference we have included b(|v|) for the first-order even and 
the lowest-order odd mode (Figs. 4 and 5). 
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Fig. 3—Field intensity for lowest-order even mode. 


Ill. THE INFLUENCE OF ADDITIONAL REFRACTIVE INDEX DIFFERENCES 


In general, if there is a gain or loss difference between two regions, 
the refractive index will be different as well. Gain and refractive index 
are related by the Kramers-Kronig relations and thus the presence of 
gain can change the refractive index. Furthermore, in the case of the 
injection laser, the injected carriers change the refractive index due to 
the plasma resonance. In this particular case, the gain region can be ex- 
pected to have a lower refractive index than the side regions. It is thus 
necessary to explore the effects of refractive index increases and depres- 
sions in the center region in conjunction with gain-loss differences. 
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Fig. 4—b(|v|) for first-order even mode. 
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Fig. 5—b(|v|) for lowest-order odd mode. 
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If there is an increase in refractive index in the center, we can expect 
the confinement of the energy to increase with increasing index differ- 
ence. However, in the case of a refractive index depression in the center, 
the two confining mechanisms will counteract each other. 

Let us consider the effect of increased refractive index in detail. In 
contrast to the case of gain-induced modes where the phase of » was 
a/4 [eq. (6) | independent of material parameters, it is now dependent 
on the complex difference ¢; — ez. This precludes a representation of 
the guiding parameters independently of the waveguide parameters as 
it was possible in the previous case. We are thus forced to determine 
a solution of the characteristic equation for each set of parameters. It 
is therefore of importance to develop approximations for the propaga- 
tion constant. We treat first the case, Ak/An « 1. We regard the wave- 
guide as a perturbed refractive guide. The b(v) characteristic is plotted 
in Fig. 6. Obviously, both v and 6 are real. A small perturbation Ak has 
two effects: (z) the characteristic eq. (8) becomes complex and yields 
an imaginary part b;, whereas the real part is to a first order unper- 
turbed; and (77) the propagation constant (9) is now approximated by 

Bz 


i =n+ Anb, = b; Ak + i(ke + b, Ak xh b; An), (10) 
0 


i.e., there appears a second imaginary component 6; An, which acts 
like a gain since An and 0; are both positive. This effect is due to the 
improvement in guiding by the gain-loss mechanism. The calculations 
are straightforward and are listed in Appendix B. We note here only 
the result: 

Face Ak (1 — bo)voVbo 


i= 11 
An 1 + vvVbo ay 


where the quantities indexed with a zero denote the unperturbed state. 
In particular, v9 = akonV2An/n. We notice that b; contains a factor 
1 — bo, which decreases with bo approaching unity. Inserting (11) into 
eq. (10) yields an expression for the loss (or gain) of the mode: 


(12) 


a = ko [ia + ak (bo + C= bdverPo) |. 


1 + voVbo 
We note that the function 


(1 = bo) vovbo 


f(vo) = bo + 1+ vb 
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Fig. 6—b(v) for lowest-order even mode in a refractive slab guide and loss-weighing 
function f(vo) for lowest-order even mode. 


associated with Ak is only dependent on v» and we have thus again 
achieved a representation valid for all waveguide parameters. Figure 
6 shows that f(vo) approaches unity much faster than bo(vo); the loss 
is approaching the loss or gain in the center region indicating improved 
confinement due to the gain-loss difference. 

We will now consider the refractive index depression in the center 
region. From our discussion of cutoff frequencies, we know that modes 
in this case do have a cutoff frequency if they exist at all. It can be 


determined from the condition that w = vvb is purely imaginary. 

In Fig. 7 real and imaginary parts of b are plotted as functions of 
s/X with the parameters listed in the figure caption. Since 0, is larger 
than one, eq. (10) shows that the effective refractive index is smaller 
than that of either medium (An is negative). The phase velocity is 
therefore larger than that of a plane wave in either medium, as in 
metallic waveguides. Altogether, this is quite a different behavior from 
the ordinary dielectric waveguide. One would expect the index depres- 
sion to counteract the confinement and, if the depression is strong 
enough, to destroy it completely. As discussed before, this is only the 
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Fig. 7—b(s/d) for lowest-order mode for a refractive index depression An = 107, 
n = 3.56, ki = 2.1074, ke = 1074. 


case below the cutoff frequency. To ensure that the solution of the 
characteristic equation is real and not a freak of the computer program, 
we have derived an approximate solution for Ak/An < 1 in Appendix 
C. The normalized propagation constant is given by 


7? .2 Ak 
where 
nN n 


IV. GAIN-INDUCED MODES IN STRIPE-GEOMETRY GaAs LASERS 


The function of the stripe geometry has been viewed as selecting 
a filament and preventing others from forming.* We will now show that, 
in contrast to a laser with a wide area contact, the stripe geometry 
does provide a confining mechanism for the optical power and is not 
merely “‘selecting’’ the filament. 

In stripe-geometry lasers, the flow of carriers is confined in a stripe 
region parallel to the junction (laterally). This is done by proton bom- 
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barding the adjacent regions.? The proton bombardment and subse- 
quent annealing alter the free-carrier density but not the optical prop- 
erties of the material. There are no deliberately built-in refractive 
index changes between the active and the passive regions. The experi- 
ments show that with stripe widths of ~12 um the lasers operate gen- 
erally in a single lateral mode; the origin of this confinement has not 
yet been explained. Figure 8a gives a linear scan of a typical nearfield 
distribution. However, at high current levels or wider stripe widths, 
numerous deviations from such a single-mode field distribution occur. 
Figure 8b gives an example of the nearfield in such a case. The field still 
fills essentially the whole stripe width but is highly nonuniform. Fre- 
quently the intensity maximum changes its position with changing 
current. It is clear that the simple case we have been analyzing so far 
cannot explain these effects. However, not enough knowledge is avail- 
able at this time about the numerous parameters involved. We there- 
fore try only to isolate the common properties of the majority of stripe- 
geometry lasers and explain them in terms of gain-induced modes. We 
had three major facts to consider: 


(t) For stripe widths in the order of 12 um and less, there is gen- 
erally a single mode for current levels close to-threshold. 
(ii) At stripe widths above ~18 ym, the field distribution is very 
often nonuniform even at threshold. 
(777) The threshold current increases steeply with decreasing stripe 
width. 


Since we know that there must be a gain-loss difference between the 
stripe and the surrounding regions, we will apply the previous results 
on gain-induced modes and investigate if the experimental results can 
be explained by this effect or if additional mechanisms have to be in- 
voked. In doing so we must keep in mind that we can expect the 
agreement between experiment and this theory to be only qualitative 
since a number of effects are neglected. We mention the few most 
obvious: 


(z) The loss in the layers above and below the active region 
contributes to the overall loss. This contribution is dependent 
on stripe width. 

(t7) The gain depends on the field intensity and is therefore not 
constant. 
(7it) The mirror loss may depend on the stripe width. 


The mirror reflectivity R of the lowest-order mode was determined to 
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Fig. 8—Linear scan of nearfield distribution of stripe geometry lasers with stripe 


width of ~12 um. One division ~3.7 um. Current in both cases is ~20 percent above 
threshold. 


be ~0.3 and hence the mirror loss, defined by ay = 1/L In 1/R, to be 
=10/cm for L = 400 um sample length.® The attenuation of GaAs is 
~10/cem. We ask now how much gain would one need to reach thresh- 
old with an active region width of ~12 um. In the following, we will 
call gain g the excess gain over the intrinsic attenuation in the active 
region. To reach threshold, the effective gain of the lowest-order mode 
(gb, — a(1 — b,) from eq. (9)) must equal the mirror losses: 


goby — a(1 — b-) = am, (14) 
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Fig. 9—Threshold gain as function of stripe width. Mirror ee au = 10/cm, 
n = 3.56, a2 = 10/cm, and refractive index depressions An = 107%, 


where @ is the loss in the side regions. b,-as pointed out before-is ex- 
clusively a function of |v| (Fig. 2). g\/4z is plotted as a function of 
stripe width in Fig. 9. We can see that the gain value corresponding to 
~12 um stripe width is quite realistic and we conclude that the gain- 
induced guiding must play at least some role in the stripe-geometry 
confinement. As a further piece of experimental evidence, it has been 
shown by Dyment?® that the threshold current in a stripe-geometry 
laser increases strongly with decreasing stripe width. There is, unfor- 
tunately, some uncertainty about the relationship between current 
and gain. Generally, an exponential dependency is assumed (g ~ J, 
where q varies between 1.5 and 3). We have used qg = 2 to insert 
Dyment’s measured results into Fig. 9. We have furthermore intro- 
duced g(s/\) curves for refractive index depressions of 1074 and 1073 
which correspond to carrier densities of 10!8 and 10!°/em, respectively. 
The agreement is satisfactory considering the accuracy of the measure- 
ments. It therefore seems reasonable to conclude that the gain-induced 
modes provide the dominant confining mechanism in stripe-geometry 
lasers. In view of this conclusion we will now derive a few properties of 
the gain-induced modes which should be useful for predicting some 
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properties of the stripe-geometry lasers. Wherever the ouput of the 
laser is to be coupled into an optical system, it is desirable to reduce the 
stripe width without increase in threshold current. Equation (14) was 
therefore evaluated for various loss values in the side region. Figure 10 
shows that—at least for zero refractive index difference-a has very little 
influence on the necessary gain. Evidently the increased loss in the 
higher a case is compensated for by improved guidance. 

A refractive index depression generates a gain-width relationship of 
the form [eq. (18) ] 


k J ] 
g=ay+—— 


Ann? (s/d)? [2An 
n 


It is quite evident that it is very difficult to compensate for the very 
strong s/\ dependence by proper choice of An. It is, however, possible 
to reduce the stripe width with a refractive index increase, in which 
case the effective gain is given by [eq. (12) ] 


_ aw +a(1 — f(v)) 

f(r) 
where f(v) is plotted in Fig. 6. However, at present, the technological 
difficulties of this approach have not been solved. 


(15) 


(16) 
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Fig. 10—Threshold gain as pneu of stripe widths for different attenuation a in 
the side regions. ay, = 12.5/em 
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V. CONCLUSION 


Modes caused by gain attenuation differences in a slab structure 
have been investigated. They can exist without refractive index differ- 
ences, in which case the lowest-order even modes do not have a cutoff 
frequency. The cutoff frequencies for the next-higher modes are given. 
The influence of additional refractive index increase or decrease in the 
center region is studied. One particularly interesting result is that, in 
the presence of gain, a refractive index depression does not remove the 
mode, but it causes it to have a cutoff frequency. 

An application of these results to the problem of lateral confinement 
in stripe-geometry lasers shows that gain-induced modes are mainly 
responsible for the optical confinement. Thus the stripe geometry does 
not just select a filament, but provides a confining mechanism for the 
optical power. (A similar conclusion was reached independently by F. 
R. Nash.!!) The sharp increase of the threshold current with reduced 
stripe width is a direct consequence of this gain-induced confinement. 
It therefore appears impossible to reduce the stripe width significantly 
below ~10 um without a substantial sacrifice in the threshold current. 
A deliberately built-in refractive index increase in the center region 
should alleviate this situation. 


APPENDIX A 


Field Components in Slab Guide 


TE TM 
|x| Sa 
aoe Oe 
Ey = Awpo cos ua e JBaz H, = Aweoes cos ue IBaz 
v —jBz2 v —jBzz 
H, = —AB, cosu=e z Ez = AB, cos ue IB: 
H, = —jA —sin u — e7 ee EL, = jA —sinu— e~## 
a a a a 
|x| 2a 


Ky, Awnuo cos we lsBzetw ([z/a|—-1)] H, = Awegjes COS ye 198 lezt+w (| 2z/a|—-1)] 

H, = —AB, cos we7'8e2+wlz/al-D) BB, = AB, 2 cos yen libee tv (I2/al—-D] 
€2 

., We 


-4 W : 
H, = FjA Ge ee H,=+7A Ae 
2 


X cos ye liBzz+u (I z/a|—1)] 
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The notations are explained in the list of notations at the end of the 
paper. The separation parameters are related to each other by 
(B.a)? + uw? = e(koa)?, (Bea)? — w® = €2(koa)?, 


and wu? + w? = (e = €2)(Koa)?. 


APPENDIX B 


Approximate Solution of Characteristic Equation for Refractive Index 
Increase in Outer Region 


We assume that Ak/An < 1, i.e., the refractive index step is govern- 
ing the guiding properties. The normalized frequency v can then be 


expressed by 
s An . Ak 


We treat the characteristic equation for TE modes first [eqs. (3) ]. The 
regular solution for Ak = 0 will yield real values for wu and w. For 
small Ak/An, both will be complex with a small imaginary part. We 
list the real and imaginary part of the characteristic equation separately, 
neglecting second-order terms: 

Uy SIN Ur = Wy COS Ur (18a) 


uiL(1 + w,) sin uy + Ur, COS Ur] = Wi COS U;. (18b) 


The real part (18a) is identical to the unperturbed equation and we as- 
sume, therefore, u, and w, to take the values of u and w, respectively, 


of the Ak = 0 case. Combining (18a) and (18b) yields the following 
relation between wu; and w;: 


uiluZ + w,-(1 + w,)] — wu, = 0. (19) 


Now we have to relate u:, w; to b;, the quantity we really want to 
determine. 





w=vvb u=ovvl—ob. (20) 
With a small imaginary part of b we get 
. Vi b; 
w = 0,Vb, ji+5(F+55)| (21a) 
= = . Vi = bk ; 
u=o0,V1 — 6, E + jvr ( x — b,) )| (21b) 


These equations show that, if the real parts of w and u are unperturbed 
to the first order, the real part of b will be unperturbed also. If we intro- 
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duce eqs. (21) into the characteristic eq. (19), 6; can be determined 


- Ak (1 cam b,)v,Vb, 
~ An I+ Urvb,. 


b; 


(22) 





The characteristic equation for the TM mode has an additional factor 
€1/€2. It is easy to show that the real part of b is altered by Ab from the 
TE value bo 


(23) 





The imaginary part is to a first approximation the same as for the TE 
mode [eq. (22) ]. 
APPENDIX C 


Approximate Solution of Characteristic Equation for Refractive Index 
Depression in Center Region 


Again we assume Ak/|An| < 1. An will be negative. The normalized 
frequency v is now expressed by 


2.68 [2| An| ganz) = ae <8 =) 
y= jynn mI (14 jsra = joi (1 i San] (24) 
Let us assume that wu is very close to 7/2, 

u=7r/2+ 6+ jy, (25) 


where 6 and y are small quantities. Since 


emo -4(3)], a» 


where we note that u/v is small compared to one. The characteristic 
equation to a first-order approximation takes the form 





we use 


($+ 8+ iv) + ino + JH = 0. (27) 


The solution is 





Vi 
Tsing: = 


WT 
214+ 03 2i+v (28) 


abd 
. 
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It is now easy to calculate b, which is related to u by 


The result is 


p-1-(4)" (29) 


2 
pai tB{i+szatol (30) 





keeping in mind that v,/v; « 1 and v? > 1. 


NOTATIONS 
a = Sth 
nN 
a 
b 
8 = 2a 
An = Ni — Ne 
Ak a ky = ko 
€0 
€1, €2 
g 


ky = w Hoeo 


ki,2 


nN 
Ho 
1,2 
27c 
y= —— 
A 
U 
v= akovVex pooest 
WwW 
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Heat Transfer in Electronic Systems 
With Emphasis on Asymmetric 
Heating 


By W. AUNG 
(Manuscript received January 26, 1973) 


The trend in electronic circuit design is toward increasing power dis- 
stpation density. The performance and reliability of increasing number of 
electronic systems are now seriously threatened by thermal effects, so that 
it is necessary to reappraise the relevant thermal design procedures. This 
paper examines natural convection cooling and concerns the prediction 
of maximum temperatures of electronic cabinets containing arrays of 
vertically oriented circuit cards with unequal power dissipation levels. By 
means of a vertical channel model, the effects of channel spacing, channel 
height, and power dissipation level are assessed with emphasis on asym- 
metric powering of the channel walls. Methods are indicated for rapid 
evaluation of maximum temperatures and optimum channel spacing with 
asymmetric heating. The present results show that asymmetry reduces the 
thermal performance of the channel. Consequently, the power dissipation 
on the channel walls should be made nearly equal. 


I. INTRODUCTION 


For the past several years, the trend in electronic equipment design 
has been toward ever-increasing circuit speeds. Now it is common for 
response times of telecommunication equipment to be specified in 
terms of nanoseconds while, in high-performance data processing sys- 
tems, the required response times are given in the picosecond range. 

The increase in circuit speed is facilitated by integration of circuit 
functions, circuit miniaturization, and higher-density packaging. Al- 
though the miniaturization of circuits results in decreased power dis- 
sipation per circuit, the power generation per unit volume, which is the 
important parameter in determining the circuit temperature, is actually 
increased due to the much higher packaging densities. The thermal 
problem is also compounded by the lower operating temperature re- 
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quirements of integrated circuits. This gives rise to a challenging under- 
taking in thermal design. And yet a poor thermal design could pos- 
sibly lead to complete failure or unreliable performance of equipment. 
Thus, it is imperative that a sound thermal design be initiated, and this 
at the earliest possible stage of system planning. 

The thermal design of a modern electronic system is based on a ra- 
tional selection of a cooling option followed by a thoughtful design con- 
sideration. A design should not only be practicable, but also economi- 
cal, serviceable, reliable, and compatible with other system compo- 
nents. It is the purpose of this paper to present design data on natural 
convection cooling of modern electronic systems. The emphasis is on 
the prediction of maximum temperatures in equipment. Although na- 
tural convection is the oldest method used in electronics cooling, the 
method is still employed extensively in new generations of equipment 
both in the communications and the data processing fields. In many 
cases, this method is used in conjunction with one or more of the newer 
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Fig. 1—Schematic diagram of electronic equipment. 
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Fig. 2—Two-dimensional vertical channel model. 


methods such as forced flow and the application of heat pipes and 
liquid cooling. 

The present paper concerns equipment wherein circuit boards con- 
taining heat-dissipative components are mounted vertically in channel- 
like fashion as shown in Fig. 1. Since the circuit boards are not iden- 
tically powered, the average heat fluxes from the columns are different, 
giving rise to asymmetrical air flow. In each channel, the heat transfer 
may be modeled by that in a vertical, two-dimensional, smooth-walled 
channel as shown in Fig. 2. The channel walls are treated as uniformly 
heated. However, the two wall heat fluxes need not be identical. 

A number of investigations have been reported in the literature 
concerning free convection in vertical channels. Ostrach! solved the 
combined free (natural) and forced convection problem in a vertical 
channel in fully developed flow with symmetrical uniform heat flux and 
internal heat generation. By fully developed flow is meant a situation 
in which the fluid velocity is invariant in the direction of flow. Engel 
and Mueller? investigated the effect of nonisothermal channel walls by 
assuming constant heat fluxes. Their results, however, are limited to 
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symmetrical wall heating. Lauber and Welch? considered the combined 
free and forced convection between vertical flat plates which are heated 
at uniform but different heat fluxes. Their work is confined to fully 
developed flow. 

Two standard references widely applied to heat transfer in electronic 
equipment are the experimental work of Elenbaas* and the numerical 
solution of Bodoia and Osterle.* Inasmuch as the above references deal 
with a channel whose walls are at a constant temperature, two factors 
are therefore ignored in their results: the effect of nonisothermal sur- 
faces and asymmetric heating. These effects are considered in the 
present paper. 

It is to be noted that, as in the references cited above, radiative trans- 
fer of heat is ignored in this paper. In some electronic systems, radiation 
may be a significant factor in heat removal. Inasmuch as temperatures 
of heat sources are of interest, the neglect of radiation usually results 
in conservative (too high) estimates of the temperatures. 


II. ANALYSIS 


Referring to Fig. 2 which shows a two-dimensional, straight vertical 
channel, let the channel height be ¢ and width be b. The channel walls 


FULLY DEVELOPED FLOW EQ. (5) 


SINGLE PLATE 
SOLUTION 
EQ. (7) 


(Tax — Tp) k/ab 





_Fig. 3—Relation between dimensionless maximum wall temperature rise and the 
dimensionless parameter L (from Aung, et al.®). 
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are heated causing fluid to rise between them by free convection. The 
thermal conditions of the walls are characterized as uniform heat flux 
(UHF), but the individual values for the two walls need not be the 
same. The fluid that enters the channel at the ambient temperature 7'p 
is assumed to have a flat velocity profile wo. The equations expressing 
conservation of mass, momentum, and energy for free convection in 
the constant-property (except for density in the buoyancy term) 
laminar flow may be written respectively as: 


Ou Ov 
ae" oy 7 0, (1) 
CE Lag i = 
Uu ax + v ay ae oy? p dz r g8(T To), (2) 
oT oT oT 
pes (uge tose) =k, (3) 


where p’ is the pressure excess above the hydrostatic pressure. 
The boundary conditions are: 


Forx =Qand0<y< 0b: 
U = Ud, T=T; 
For y = Oandz 2 0: 


oT 
Uu = 0, v= 0, oi, = — 2; (4) 
For y = bandz 2 0: 
oT 
u = 0, v = 0, ap ae 


At x = 0 and « = ¢: p = po, where po is the hydrostatic pressure. 

The system of eqs. (1) to (4) may be cast in nondimensional form and 
then solved by numerical integration using a digital computer. Details 
are contained in Refs. 6 and 7. The latter references show that the nu- 
merical results agree well with experimental data. It may be pointed 
out that, for a channel whose height is large compared to the spacing so 
that the so-called fully developed flow condition exists, the governing 
eqs. (1) to (8) may be simplified and the solution is then obtainable in 
closed form. This is indicated in Ref. 8. In the present paper, these re- 
sults will be applied to indicate the effect of operating parameters on 
maximum wall temperatures. The cooling fluid considered is exclusively 
air. 
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III. RESULTS 


When the channel walls are individually heated at uniform heat 
fluxes, the quantity that is of design interest is generally the maximum 
temperature rise on the hotter wall. This quantity may be obtained 
from the general results shown in Fig. 3. It is clear from the latter that 
the maximum temperature increase depends on the parameter L 
which in turn depends on the quantity ¢/gb®. When the latter quantity 
is large, fully developed flow is approached® and the desired informa- 
tion on the maximum temperature increase is given by a rather simple 
expression which follows. 


3.1 Results at Large £/qb*® 


At large values of the quantity ¢/gb‘, the flow in the channel exhibits 
interesting characteristics whereby the velocity distribution across the 
channel remains unaltered with axial distance. All axial temperature 
variations are also given by linear relations. Typical velocity and tem- 
perature distributions in this situation are given in Fig. 4. This is the 
so-called fully developed flow situation. Clearly, it prevails most com- 
monly in a channel whose height ¢ is large compared to its width. In 
a channel with developing flow, typical velocity and temperature dis- 


1.0 


(T—T9)/( Tmax, 1— To) 





0.5 i 
y/b y/b 
TEMPERATURE DISTRIBUTION VELOCITY DISTRIBUTION 


Fig. 4—Typical temperature and velocity distributions in a channel having nearly 
fully developed flow. 
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Fig. 5—Typical temperature and velocity distributions in a channel having de- 
veloping flow. 


tributions are as indicated in Fig. 5. In both Fig. 4 and Fig. 5, the quan- 
tity M is a dimensionless mass flow parameter which is related to L 
(see Refs. 6 and 8). 

Figure 3 shows that, as fully developed flow is approached, all curves 
coalesce into one. Based on the derivations given in Ref. 8, it can be 
shown that, in fully developed uniform heat flux channels with asym- 
metric heating, the maximum temperatures on the hotter and cooler 
walls become practically identical and may be evaluated using the 
equation: 

Binge a = Omax,o = eo 
L=5, allraf 


For air at 120°F, the Prandtl number (Pr) is about 0.7. Evaluating the 
thermal physical properties for air in eq. (5) at a temperature of 120°F, 
we may rewrite eq. (5) as: 


free <e To = free a To = 0.1610(g)°-*(£)°-5(b) “1 


for ce = 1.19 X 10°, all ry 
qb? 


(5) 


(6) 


In eq. (6), £is expressed in ft, gin W/ft?, and bin ft. Note that, if Tmax,1 
alone is to be evaluated, eq. (6) is applicable at ¢/gb> = 4.76 X 107. 
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The reason is apparent from Fig. 3 where at L = 0.2 the deviations 
among curves Of 6max,1 at all values of ry may be neglected. 


3.2 Results at Small £/qb* 


The counter-case to the situation just discussed is the one in which 
the channel height ¢ is relatively short compared to its width b. Again, 
in this situation the maximum temperature rise on the channel walls 
may be calculated by a simple relation. Note that, to use this relation, 
which will be given below, it is not necessary that ¢ be smaller than b. 
It is only necessary to fulfill the validity limit attached to the equation. 

For relatively short channels, the results in Ref. 6 indicate that the 
maximum temperature in each wall is independent of the other wall, 
and may be calculated using the result for a single vertical flat plate.® 
The latter result is valid for the asymmetrically heated channel when 
L S 10-3. Thus, we may write: 


Omax = 2.05(L)"5 at LS 107%. (7) 


Again, inserting thermal physical properties of air at 120°F, the above 
may be written: 
Tmax ~ To = 8.6661/5g4/5 

at ne < 235X105 allraf (8) 
In eq. (8), (Tmax — 7J'o) is in degrees Fahrenheit, ¢ in feet, and q in 
watts per square foot of surface area. Equation (8), which gives the 
maximum temperature on either the hotter or cooler wall when q is 
appropriately replaced by q: or gz, shows that the maximum tempera- 
ture on the hotter wall, subject to the attached condition, is indepen- 
dent of the channel spacing which is to be expected. It may also be 


noted for reference that 
£ 1+rpr £ 
(a oe " 


3.3 Results at Intermediate ¢/qb* 


If the proposed design is such that neither eq. (6) nor eq. (8) may be 
applied, then the maximum temperature increase in the channel can 
be evaluated using Fig. 3. Parametric curves can be generated from 
Fig. 3 to display the effect of various operating variables on the maxi- 
mum temperature rise on the hotter wall. Since the value of L depends 
on the channel spacing, the average heat flux, and the channel height, 
any one of these may be varied while others remain constant at typical 


HEAT TRANSFER IN ELECTRONIC SYSTEMS 


915 


240 


200 


160 


Tmax, 1 — To [°F] 


80 





40 
0.02 0.04 0.06 


b OR bopy [ FT] 


Fig. 6—Effect of channel spacing on the maximum wall temperature rise. 
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Fig. 7—Effect of channel height on the maximum wall temperature rise. 
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Fig. 8—Effect of average heat flux on the maximum wall temperature rise. 


operating values. The following conditions are chosen as an example: 


b = 0.0365 foot (0.4375 inch) 
£ = 6 feet (10) 
g = 5.75 watts per square foot. 


The resulting curves are shown in Figs. 6 through 8 which can be used 
by the designer. To obtain Fig. 6, ¢ and g are obtained from eqs. (10) 
and, corresponding to each pair of assigned values of b and ry, L is 
calculated. A value of 6max,11S then obtained from Fig. 3. The maximum 
temperature rise can then be obtained. By varying ry and ), the solid 
curves in Fig. 6 result. The range of validity of eq. (8) is also indicated. 
Figures 7 and 8 are obtained in the same manner. The dashed lines in 
Figs. 6 and 8 pertain to optimum spacing which will be discussed later. 
From Figs. 6 through 8, it can be seen that at any fixed maximum tem- 
perature rise the effect of asymmetry is to increase the necessary chan- 
nel spacing, decrease the channel height, or decrease the heat flux when 
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any one of these quantities is taken as the sole variable in the 
equipment. 

The effect of asymmetric wall heating may be more conveniently 
examined by replotting Fig. 3 in the manner of Fig. 9. In the latter, 
the maximum temperature increase on each wall of an asymmetric 
channel is normalized by the maximum temperature increase on a 
similar channel wherein the same average heat flux is distributed evenly 
on the two walls, so that ry = 1. Clearly, as the degree of asymmetry 
is increased so that ry is decreased without affecting the average heat 
flux, the result is that the maximum temperature on the hotter wall is 
raised while that on the cooler wall is depressed. Therefore, in equip- 
ment design, it is desirable to obtain equal heating on the two channel 
walls. This is also desirable from the standpoint of optimum spacing, 
as will be seen. 

With the help of Fig. 9, the maximum temperature increases in an 
asymmetric channel may be evaluated once the symmetric heating 
value is known. The latter may be obtained rapidly with the aid of the 
nomogram shown in Fig. 10 once ¢, b, and g are known, ry being 1 in 
Fig. 10. The use of the latter has been described in Ref. 7 but is repeated 
here for ease of reference. To use Fig. 10, first locate Points 1, 2, and 4. 


2.0 
USE EO. (8) 
FOR Tax. 1 
AND Tmax. 2 
= 8 USE EQ, (6) 
" 
: FOR Ty ax. 1 
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Fig. 9—Maximum wall temperature rise as a function of the quantity ¢/gb°. 
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1 (from Aung, Kessler, and Beitin’). 


Fig. 10—Nomogram for evaluation of maximum wall temperature increase, rz 
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Draw a line connecting Points 1 and 2. Extend this line to intersect 
with the Pivot Axis 8 to give Point 3, and with Pivot Axis a to give 
Point 6. Draw a line joining Points 3 and 4. The intersection of this 
line with Pivot Axis y gives Point 5. Draw a line connecting Points 5 
and 6. Read the a value at Point 6 which for the case illustrated is 2. 
The intersection of line 5-6 with the appropriate a-branch (@ = 2 in 
this case) in the center of the nomogram (Point 7) gives the maximum 
card temperature rise over room ambient, which in this example is 
38°C or 68°F. It should be noted that each a-branch is made up of 
three segments: a left-side vertical line which corresponds to flow 
around a single flat plate, a right-side vertical line which corresponds to 
fully developed flow, and an inclined straight line Joining the two verti- 
cal ones. Moreover, the asymptotes of the vertical lines coincide for all 
values of a. Hence, if the line joining Points 5 and 6 does not intersect the 
appropriate a-branch on its inclined segment, either the left or the 
right vertical segment should be chosen depending on whether the 5-6 
line passes above the inclined segment or below it. 


3.4 Optimum Spacing Including the Effect of Asymmetry 


Consideration may now be given to the overall performance of the 
channel as a heat-removing device. In electronic equipment the maxi- 
mum device junction temperature is usually specified. Since the junc- 
tion temperature is related to the wall temperature, the maximum wall 
temperature is therefore implied. In modern equipment, it is desirable 
to increase the packing density and hence the total power dissipation of 
the entire equipment without increasing the maximum wall tempera- 
ture beyond the allowable limit. As shown in the appendix, the task 
here is to select 6 so as to maximize the heat transfer. The latter value 
will be designated qopt and the corresponding spacing is called Dopt. 
Design curves for finding bop, and Qopt Will be given below. The emphasis 
here is on asymmetrically heated channels. A more detailed discussion 
of optimum spacing in a symmetric channel may be found in Ref. 7. 
Note that maximum power dissipation is realized only when the equip- 
ment is strictly operating at bopt and Qopt. If a different (smaller) spac- 
ing is used, then a different (smaller) heat flux must be employed to 
yield the same maximum temperature increase. For this purpose, de- 
sign curves such as those in Figs. 6 through 8 can be consulted. 

Following Bodoia,!® it can be shown that the power dissipation is 
maximized if the channel parameters are so selected that the channel is 
operated at the point where the slope of a log-log plot of Nu versus Ra 
is one-half, where Nu is the Nusselt number and Ra the Rayleigh 
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number defined as: 


_ gb ey el 
= CT next — To)k 7 Griae 47 
—_ 4 a = 
Ra = Pr x gB(T'max.1 — To)b! = 0.7 x 6 max,1 x L. 


ty? 


From Fig. 3, a relation between Nu and Ra can be obtained. This is 
shown in Fig. 11. Calling the values of Nu and Ra for maximum power 
dissipation Nuop: and Racpt, respectively, Table I may be constructed 
from Fig. 11. In Table I, E is an efficiency of heat transfer defined as 


_ Pp 
(p) rH=1 
where B = Nuopte/(Raopt)!/? ¢- (11) 
- Ropt 


bopt 


In the above, hopt and bop: are optimum values of hmax,1 and b which give 


maximum power dissipation. 
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Fig. 11—Relation between Nusselt and Rayleigh numbers. 
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TABLE I—OptTimumM RAYLEIGH AND NussELT NUMBERS 


TH Raopt Nuopt E 

0 42 0.43 65% 
0.1 51 0.51 70% 
0.5 70 0.73 86% 
1.0 135 1.18 100% 


From Table I, it is seen that the efficiency E varies in the same 
manner as ry. Asymmetry therefore decreases the efficiency of heat 
transfer. From the thermal standpoint all equipment should therefore 
be designed to yield symmetrical heating as closely as possible. 

Using values of Ra»; from Table I, it is possible to obtain optimum 
spacings for different maximum temperatures once the channel height 
is specified. In like manner, the necessary heat flux at optimum spacing 
corresponding to different prescribed maximum temperatures can also 
be obtained from Nuopt. Results have been obtained for a 6-foot chan- 
nel with ry = 0.0 and ry = 0.5 and compared with the case ry = 1 
in Figs. 6 and 8 (indicated by the dashed line). It may be ascertained 
from these figures that the effect of asymmetric heating is to decrease 
bopt and Qopt. The net effect of asymmetry is to decrease the total power 
dissipation in the cabinet (see appendix). 
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Fig. 12—Optimum spacing and optimum heat flux as a function of the heat flux 
ratio rz. 
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Fig. 13.—Nomogram for evaluation of optimum spacing, ry = 1 (from Aung, 
Kessler, and Beitin’). 


Using the values of Raop: and Nuopt given in Table I, it is possible to 
obtain a general relation between optimum spacing bop: or optimum 
heat flux gop: and ry, since 


Dont = (Raopt) ue 


(Bopt) ry—1 ~ (Raopt)74=1 
and 


Gopt = N Uopt : 
(Gopt) rH=1 (Nuopt) rH=1 


The results are indicated in Fig. 12. With the aid of this figure and the 
nomogram given in Fig. 13 which yields (bopt)+,=1, the optimum spac- 
ing at specified ry, ¢, and maximum hotter wall temperature rise 
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(Tmax,1 — To) can be rapidly evaluated. Note that the maximum tem- 
perature rise to be entered in Fig. 13 is also the specified value of 
CP evar ja See, T»). 


IV. CONCLUDING REMARKS 


In the present paper, consideration has been given to the free or 
natural convection cooling of electronic cabinets containing arrays of 
vertical circuit cards with asymmetric heating. It was indicated that 
fully developed flow prevailed in a channel whose height is large com- 
pared to its width. In this case, the maximum temperature in the chan- 
nel can be calculated using the simple relation presented here. In con- 
trast, when the channel spacing is relatively large, the maximum tem- 
perature on each wall is independent of the conditions on the other 
wall but can again be calculated by a simple equation. 

In situations where neither of the above two approaches can be ap- 
plied, the maximum temperature in the channel can be evaluated 
rapidly using the graphical procedure outlined in this paper. The opti- 
mum channel spacing for maximizing the power dissipation is discussed 
and it is emphasized that to reduce adverse thermal effects channel 
walls should be powered as equally as possible. 


APPENDIX 


The total power dissipation in the cabinet per unit depth can be 
written as 
2tNG 
2ENA(T max,1 — To), 


Power 


where WN is the number of channels (see Fig. 1). If the thickness of the 
channel wall is neglected, we have 


W 
N= a 
Hence, 
Power = 2¢W(Tmax,1 — T)* 


h 
[oy 


b 


for fixed (Tmax,1 — To), W, and ¢. Clearly, to obtain the largest pos- 
sible power one needs to obtain the maximum hf (that is g) and the 
minimum 0, the combination of which gives the specified (T'max,1 — To). 
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NOTATIONS 

b clear channel or card spacing in feet (ft) 

Cy specific heat of air in W-s/Ibi-F 

g acceleration due to gravity in ft/s? 

h G/(T max,1 — To), average heat transfer coefficient, in My F 
k thermal conductivity of air in W/ft-F 

Ly, (4k) /(g8qib*), dimensionless 

L  (tv"k)/(g@qb*), dimensionless 

£ channel height in ft 

Pr Prandtl number defined by uc,/k, dimensionless 
p pressure in lb;/ft? 

oo hydrostatic pressure in lb;/ft? 

p’ (p — po) in lb;/ft? 

q heat flux on channel wall per unit surface area, in W/ft? 
G@ = (1 + G2) /2, W/ft? 

ru  Q2/Q1, dimensionless 

T temperature in degrees Fahrenheit (F) 

T,) ambient temperature in degrees Fahrenheit 

u,v axial and transverse velocity in ft/s 

W _ width of cabinet 

x,y axial and transverse coordinates in ft, see Fig. 2 
B thermal expansion coefficient of air in 1/F 

Lt dynamic viscosity in lb,,/ft-s 

y kinematic viscosity of air in ft?/s 

p density of air in lb,,/ft® 

6 dimensionless temperature rise defined in Fig. 3 
Subscripts 

1 refers to hotter wall 

2 refers to cooler wall 

max maximum value 


max, 1 maximum value on hotter wall 
max, 2 maximum value on cooler wall 
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On the Selection of a Two-Dimensional 
Signal Constellation in the Presence 
of Phase Jitter and Gaussian Noise 


By G. J. FOSCHINI, R. D. GITLIN, and S. B. WEINSTEIN 
(Manuscript received February 1, 1973) 


A long-standing communications problem is the efficient coding of 
a block of binary data into a patr of in-phase and quadrature components. 
This modulation technique may be regarded as the placing of a discrete 
number of signal points in two dimensions. Quadrature amplitude modu- 
lation (QAM) and combined amplitude and phase modulation (AM-PM) 
are two familiar examples of this signaling format. Subject to a peak or 
average power constraint, the selection of the signal coordinates is done so 
as to minimize the probability of error. In the design of high-speed data 
communication systems this problem becomes one of great practical sig- 
nificance since the dense packing of signal points reduces the margin 
against Gaussian noise. Phase jitter, which tends to perturb the angular 
location of the transmitted signal point, further degrades the error rate. 
Previous investigations have considered the signal evaluation and design 
problem in the presence of Gaussian noise alone and within the framework 
of a particular structure, such as conventional amplitude and phase modu- 
lation. We present techniques to evaluate and optimize the choice of a signal 
constellation in the presence of both Gaussian noise and carrier phase 
jitter. The performance of a number of currently used or proposed signal 
constellations are compared. 

The evaluation and the optimization are based upon a perturbation 
analysis of the probability density of the received signal given the trans- 
mitted signal. Laplace’s asymptotic formula is used for the evaluation. 
Discretizing the signal space reduces the optimal signal design problem 
under a peak power constraint to a tractable mathematical programming 
problem. 

Our results indicate that in Gaussian noise alone an improvement in 
signal-to-noise ratio of as much as 2 dB may be realized by using quadra- 
ture amplitude modulation instead of conventional amplitude and phase 
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modulation. New modulation formats are proposed which perform very 
well in Gaussian noise and additionally are quite insensitive to moderate 
amounts of phase jitter. 


I. INTRODUCTION 


A very attractive modulation format for coherent high-speed data 
transmission is the family of suppressed-carrier, two-dimensional signal 
constellations of which quadrature amplitude modulation (QAM) and 
combined amplitude and phase modulation (AM-PM) are two ex- 
amples. In this paper we will consider using this more general signal 
format, which is equivalent to the arbitrary placement of a discrete 
number of signal points in the plane, subject only to a peak or average 
power constraint. The object will be to mitigate the major statistical 
transmission impairments encountered on the voice-grade telephone 
channel, such as carrier frequency offset, carrier phase jitter, and 
additive noise. Our attention is focused on constellations of 16 points, 
since this seems to be the largest constellation which is practical 
for the typical telephone channel. However, the techniques we develop 
are applicable to constellations of arbitrary size. 

The placing of signal points in the plane is a long-standing problem 
that has received considerable attention in the past. Previous investi- 
gations!’ have considered the signal evaluation and design problem in 
the presence of Gaussian noise alone and within the framework of 
a particular structure such as combined amplitude and phase modula- 
tion. When Gaussian noise is the only transmission impairment, it is 
well known that at high signal-to-noise ratios (>25 dB) the signal 
points should be placed as far apart from each other as possible (the 
circle-packing problem). In the application to high-speed digital com- 
munication systems, the two-dimensional signal design problem be- 
comes one of great practical significance because the dense packing of 
a large number of signal points markedly reduces the margin against 
random noise and phase jitter. The signal design problem in the pres- 
ence of both phase jitter and Gaussian noise has not been solved before 
and is the subject of our discussion. 

Coherent receiver structures have recently been proposed which em- 
ploy an adaptive equalizer’ to compensate for any linear distortion, 
low-frequency phase jitter, or small amounts of frequency offset intro- 
duced by the channel. A phase-locked loop> may be used to suppress 
high-frequency phase jitter and frequency offset. Of course, the output 
of a phase-locked loop will still deviate somewhat from the optimum 
demodulating phase angle. One purpose of this study is to assess the 
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effect of such phase errors on the system error rate and to indicate how 
this knowledge can be incorporated into the system design. 

By system design we have in mind the selection of both a particular 
two-dimensional signaling format and the decision device placed at the 
demodulator output. In order to pursue these objectives, we discuss: 


(t) the relative immunity of various signal constellations as a func- 
tion of the degree of noncoherency (i.e., the size of the phase 
error); 

(77) an efficient iterative procedure for determining optimum signal 
formats under a peak power constraint; 

(272) system performance for the following hierarchy of decision de- 

vices: easily implementable, optimum in Gaussian noise, and 
a maximum-likelihood detector which uses the statistics of the 
phase error; 

(tv) the accuracy required in any phase-locked loop to attain a satis- 

factory error rate; and 

(v) the resulting error rate when no attempt is made to track the 
jitter. * 

Our approach is to assume that intersymbol interference has been 
effectively eliminated by the equalizer while the phase-locked loop, if 
there is one, has only partially removed the phase jitter. Thus the 
equalizer output will be the sum of the partially coherent’ transmitted 
signal and additive Gaussian noise. We adopt a phenomenological 
model which assumes that the (slowly varying) phase error has 
a Tikhonov density.> The Tikhonov density is associated with a con- 
ventional first-order phase-locked loop whose input is the sum of a 
sinusoid (whose phase is being tracked by the loop) and Gaussian 
noise. Under our assumed operating conditions of high signal-to-noise 
ratio, this density will closely approximate the actual phase density. 
When no attempt at tracking is made,’ the jitter is modeled as being 
uniformly distributed in a reasonable peak-to-peak range. For each 
of these jitter densities, the probability density of the demodulator 
output, conditioned on the transmitted symbol, is used to estimate the 
error rate. This estimate is of the minimum distance type, where the 
“distance,”’ which reflects the presence of phase jitter, is measured in 

*In the sequel, we will use the term jitter as a catch-all when referring to phase 
jitter and/or frequency offset. 

T A partially coherent signal is one whose carrier phase is jittered by a random com- 
ponent which is not uniformly distributed in the range (—z, 7). 

* Since the passband equalizer‘ will determine the optimum static demodulating 


phase, the absence of a phase-locked loop does not imply the use of an arbitrary de- 
modulating phase. 
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a non-Euclidean manner. The error rate is given for various signal 
constellations and detector structures under peak and average power 
constraints. By discretizing the received signal space, an iterative 
procedure is developed to determine (locally) optimum signal formats 
under a peak power constraint. This technique, which assumes a maxi- 
mum-likelihood detector, makes use of an efficient search procedure 
developed by Kernighan and Lin.*® 

The system model and the problem formulation are presented in 
Section II. An asymptotic estimate and an upper bound on the error 
rate are developed in Section III. A comparison of the relative im- 
munity of some popular signal constellations and detectors to phase 
jitter is described in Section IV. Section V discusses a technique to 
obtain locally optimum signal structures under a peak power constraint. * 


II. SYSTEM MODEL AND PROBLEM FORMULATION 
2.1 Preliminaries 


We consider the two-dimensional synchronous data communication 
system shown in Fig. 1. Binary data are first grouped into blocks of 
M bits, and each block of M bits is then. mapped into one of 2” two- 
tuples (a, b). The sequences {az} and {b,} amplitude modulate, re- 
spectively, an in-phase and quadrature carrier to generate the trans- 
mitted signal 


mt) = y axp(t — kT) cos we + x byp(t — kT) sin wet, (1) 


where 1/T is the symbol rate,’ p(-) represents the transmitter pulse 
shaping, and w, is the carrier frequency. It will be assumed that the 
two-tuples are equiprobable. The received signal at the output of the 
bandpass filter is given by 


rt) = (2. axe(t — kL) — 2, bay(t — kT) cos (we +.A)t + 6(0)) 
— (u axy(t — kT) + y b.(é — kT)) 
X sin ((we + At + O()) +n, (2) 


where x(t) and y(¢) are the system (baseband) in-phase and quadrature 
impulse responses,’ A is the carrier frequency offset, @(¢) is the random 


“The present authors have recently treated the two-dimensional signal design 
problem under an average power constraint.” 
: Note that the data rate is M/T bits/second. 
t These pulses represent the cascade of the transmitter shaping filter, the channel, 
and the receiving filter. 
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Fig. 1—An in-phase and quadrature data transmission system. 
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phase jitter, and n(t) is additive Gaussian noise. The received signal, 
which is sampled at the symbol rate, is adaptively equalized and 
then coherently demodulated with the aid of a phase-locked loop. 
The demodulated output, denoted by the sequence of two-tuples 
Z, = {(zx, 2z)}, 1s then processed by the (simple) detector to give the 
output sequence {(é;, 5,)}. The system error rate is just the probabil- 
ity that (4;, 5,) differs from the transmitted two-tuple (az, b;). 


2.2 Basic Model 


For the purposes of this study, it will be convenient to assume that 
the equalizer has completely eliminated the intersymbol interference 
present in x(¢) and y(t), but that the phase-locked loop has only par- 
tially compensated for the carrier phase jitter. The in-phase and 
quadrature demodulator outputs, at the kth sampling instant, are 
then given by 


2(kT) = a, cos d; — bx sin dx + n. (kT), 


(kT) Qx sin ?% + bx cos ox + n.(kT), (3) 


Il 
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Fig. 2—Effect of Gaussian noise and phase jitter on transmitted symbol. 


where ¢; is the (slowly varying) phase error in the tracking loop, and 
n(kT) and n,(kT) are, respectively, the in-phase and quadrature 
Gaussian noise components.* Dropping the time index, eq. (3) can be 
rewritten to give the basic model 


z= Rs+n, (4) 


where the vectors are given byt 


oe )e We) BAG): 


and the matrix F is the rotational (by an angle ¢) transformation 


Ps ne —sin s): (5) 


sin @ cos d 


* Recall that n-(kT) and ns(kT) are independent Gaussian random variables with 
equal variance, No. It should be noted that 2No is the noise power contained in the 
bandwidth of the received signal. 

t We denote the values that a and b can assume by a“) and b“, respectively, and 
the values of the transmitted symbols by s® = (a%),b%), 
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Fig. 3—Quadrature amplitude modulation. 


As we show in Fig. 2, the effect of the phase jitter is to rotate the trans- 
mitted symbol, s, by an angle ¢; thus the demodulator output, z, is dis- 
persed in an angular manner due to phase jitter and in a circularly sym- 
metric way due to the Gaussian noise. The receiver will make an error 
when these perturbations move the demodulator output across the 
decision boundary associated with the transmitted symbol. For a par- 
ticular transmitted data sequence, the demodulated sequence will be 
scattered about the transmitted points in a manner which reflects the 
combined effects of phase jitter and Gaussian noise. For the transmitted 
signal constellation of Fig. 3, which is known as QAM, a typical scat- 
tered demodulated sequence is shown in Fig. 4. As one might expect, 
for those signal points further away from the origin, the angular dis- 
placement becomes more apparent. An estimate of this effect is given 
by the mean-square error between the transmitted and demodulated 
symbols. For small values of jitter, this error is obtained from (4) by 
noting that 


z— 30 = (Ono —sin ¢ )so+n—6( 3%) 4a 
3 


sin @ cos ¢@ — 1 
averaging the norm-squared of both sides gives* 


Ei|z — s|/? = No + agils|?, (6) 


*The notation ||z|| denotes the Euclidean norm of z; additionally the notation 
{z, 8) will be used to denote the inner product of z and s. 
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Fig. 4—Received signal points in a QAM system: 


where o% is the variance of o and E denotes the statistical average. 
Thus, because of phase jitter, signal points located further from the 
origin are subjected to a larger mean-square error. 
2.3 Probability Density of the Demodulator Output 

In order to evaluate the system error rate, the probability density 
function (pdf) of the phase error must be specified: The pdf of the phase 
error in a phase-locked loop that is tracking the angle of the two- 
dimensional data signal given by (2) is not yet known, but as explained 
below it can be approximated by the following (Tikhonov) density®: 


1 e@ @cos ¢ 


pO) =a Toy «dl Sm (7) 
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where Jo(-) is the modified Bessel function of the first kind and a is 
a positive number. As is shown in Fig. 5, a = 0 implies a completely 
incoherent system, while a = © corresponds to a completely coherent 
system. For large values of a (a > 100), we have the useful relation 


a3 = (in radians). (8) 


Since the above density arises from a first-order phase-locked loop 
whose input is the sum of Gaussian noise and a sinusoid (whose phase 
is being tracked by the loop), it is felt that for high signal-to-noise 
ratio, for negligible intersymbol interference, and for slowly varying 
phase jitter the actual phase-error density will closely resemble the 
Tikhonov density. This simple model will be quite useful in studying 
the effect of phase jitter on the system error rate. 

The pdf of the demodulator output, conditioned on the transmission 
of s“, is given by 


pals) 4 pz) =" p,(zl4)p(o)as, (9) 


where it is noted that the output density conditioned on both s™ and 
@ is given by 


pi(z|8) = say. exp | — ppp. [lz — Rs | (102) 





1 ; 
it a — g(a|]2 - (3) 
— 2(z, Rs). (10b) 
Substituting the Tikhonov density into (9) gives 


ps2) = gery ex? | — say. Elle — 8°91 + 202, 8)]}- 


(Tola))~* 5 [7 exp 5 [(dz, 8) + aN) 008 4 
+ (2, 8) sin 614, (11) 


(where if s& = ||s®||(cos a, sina) then s+ = ||s||(sina, —cosa@)), 
and we recognize the latter integral as 





1 (EET ENI FG FOR), 


Assume a = k/No (k a constant). Employing the well-known result 
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—7 0 T 


Fig. 5—Tikhonov phase jitter density p (6) = exp (a cos ¢)/27I,(a). 


that for large values of argument I(x) ~ e*/|Vz|, and assuming a is 
sufficiently large and N» is sufficiently small, we get 


yao ae ene oe ee a (— ayy 
PAY * BaN WN W(a, 89) + aN)? + (@, 8) 2 \~ BN 
X {]z, s|]? + 2(z,s) + 2aNo 
— NGS FANT ESA). (12) 


In the error rate computations we shall eventually make, we will have 

k so large compared to the practical range of (z, s“) that the coeffi- 

cient multiplying the exponential of p,(z) can be taken to be (27No)71. 
Thus, for our purposes, 


1 1 ? 
pila) ~ sane exp | — gay C2, 8°) |, (13) 
where 
d2(z,s) & ||z — s||? + 2(z, s) + 2aNo 
— 2 v|[z|/?||s]]? + 2aNo(z, s) + (aNo)?|. (14) 


The form of eq. (13) is very reminiscent of the density in Gaussian 
noise alone, and to further suggest such a similarity we refer to d(z, s) 
as the ‘‘distance’”’ between z and s. It is important to emphasize that 
this function is the key to assessing the combined effect of Gaussian 
noise and phase jitter on the system performance; through its use we 
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SNR = 25 dB, AND RMS PHASE JITTER =9 DEGREES 





Fig. 6—Constant distance contours: values of z for which 
2aNopj(2) = exp { — gaz @(z, 83) } = 10-4, 10-4, 10, 
d? given by eq. (14). 


are able to give a very suggestive geometric interpretation of the jitter 
phenomenon as well as accurate estimates of the error rate. In Fig. 6 
we show some contours of constant “distance’’ (i.e., points which are 
equiprobable) about a given point. Note the angular orientation and 
similarity of these ‘banana’ shaped contours to those obtained ex- 
perimentally (Fig. 4). 

Our procedure will be to use (13) and (14) to estimate the error rate 
for various constellations and detector structures; however, before we 
do this, we first wish to discuss some properties of the function d(z, s) 
which will be useful in estimating the system performance, and then to 
consider the conditional density p,;(z) in the absence of a phase-locked 
loop. 


2.4 Some Properties of the Jitter Distance d(z, s) 


(7) A requisite property that d(z, s) should possess is that for vanish- 
ingly small jitter the distance between points becomes Euclidean. This 
is easily verified by expanding (14) in terms of 1/a and observing that 


ee d?(z,s) = |lz — s|]?. (15) 
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(iz) Since we do not expect the distance between an arbitrary point 
and the origin to be affected by phase jitter, it is reassuring to note that 


d?(z, 0) = |z\/?, (16) 
and that a similar property holds for points on the same ray, i.e., 
d2(z, kz) = (1 — k)?||z|[2, k>0. (17) 


(472) A “bonus” property is that even though (14) was derived for 
large a (small jitter), the expression 


lim (2, 8) = | [lz] — Isl | (18) 


is quite reasonable in that, for completely incoherent systems, all 
points on the same circle are indistinguishable. 


(zv) An interesting consequence of (14) is that points are now closer 
together than their Euclidean distance. To demonstrate this, we use 
the Schwartz inequality to show that the non-Euclidean part of d?(z, s) 
is always negative, i.e., 


(Z, 8) + aNo S |v|lz|/?IIs|]? + 2aNo(z, s) + (aNo)?| (19) 


since squaring gives 
(z, 8)? + 2aNo(z, s) + (aNo)? S |lz\|2\/s[]? + 2eNo(z, s) + (aN)? 
which upon cancelling becomes the Schwartz inequality 

(z, 8)? S ||z||?IIs||?. 


(v) If both the signal and noise power are scaled by the same con- 
stant for a given jitter level, a, it is clear that the conditional density is 
unchanged. Thus the signal-to-noise ratio and the mean-square jitter 
a are the natural parameters for characterizing the system. 

(vi) A natural question to ask is whether or not d(z, s) is a convex 
metric* in the plane or, of more practical interest, if some sufficiently 
accurate approximation to d(Z, Ss) is a convex metric in some circle 
centered about the origin. As we shall see later, if such an approximate 
representation can be obtained, some tedious error rate computations 
may be done quite simply. While the requirements of symmetry and posi- 
tivity can easily be shown to hold in the entire plane, J. E. Mazo has re- 
cently informed us that his results in Ref. 8 imply that d(z,s) is not a 

* A convex metric is a metric which possesses the midpoint property, i.e., for any 


two points x and y there is always a third point z such that d(x,z) = d(z, y) 
= (1/2)d(x, y). 


TWO-DIMENSIONAL SIGNAL CONSTELLATION 939 


convex metric in any circle about the origin for any value of 2No. The 
question remains open as to whether or not d(z, s) admits an accurate 
convex metric approximation in a neighborhood of the origin. Appendix 
B reports the results of an investigation of this question. 

By considering the above properties, it is apparent that a signal 
constellation will be relatively immune to small amounts of phase jitter 
either if the error rate (or minimum distance) is determined by a point 
at the origin and any other signal point, or if signal points on the same 
circle are widely separated (the more circles the greater the Gaussian 
noise penalty). 


2.5 Probability Density in the Absence of a Phase-Locked Loop 


In a subsequent section we will compare the performance of various 
signal constellations, and the following question naturally arises: Can 
we, by judicious signal design, eliminate the need for a phase-locked 
loop?* Preliminary to answering this question we must obtain the den- 
sity of the demodulated signal in the absence of a phase-locked loop. 
For simplicity we model the jitter as arising from the single tone 


$(t) = A cos (wt + Y), (20) 


where 2A is the peak-to-peak jitter, w; is the jitter frequency, and y is 
a uniformly distributed random phase. For this model, the jitter den- 
sity is given by 


1 || $A 
p(s) = {Ar alt — (4) (21) 
0 || >A 


To determine the density of the demodulated samples we use (9) and 
(10) to write 


pila) = ach, exp {~ aay [le — 8°91 + 2%, 8°) I} 


; -_ 
x [7 exp {yp Ce, 8) 008 6 + (, 8) sin 6} w(O)ao 
=A 0 
which for small (<12 degrees) peak-to-peak jitter becomes 
1 1 
yes ee ts — = |lz = gtall2 
pi) ~ sap exp | — ga lz — 818] Mole), 22) 


* The equalizer‘ is capable of determining the optimum demodulating static phase, 
so that the demodulation is noncoherent only to the degree that the untracked Jitter 
degrades the error rate. 
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where M/,(-) is the moment-generating function of ¢, and 
1 : 
bs = 7, @ 9). (23) 


For the jitter density given by (21) it is easy to show that 
M4(&) = Io(A|&;]); (24) 


thus we have the familiar form 


Deptt 2 — 1 py ow 
pila) = sar exp | — gay. D4, 8°) | (25) 
where 


DX(z, 8) = liz ~ s|* — 2Notn lo (F-1@8%|)- (26) 


It is useful to summarize our work up to this juncture. Using the 
simple model (4) and the phase-error densities (7) and (21), we have 
derived the conditional density of the demodulated two-tuple with 
and without a phase-locked loop. Equations (13) and (14) and (25) 
and (26) are the desired expressions. In the next section we will use 
these densities to estimate the system error rate. 


III. ESTIMATING THE ERROR RATE 


In this section we give two estimates of the error rate: an asymptotic 
(high SNR) evaluation and an upper bound. Consider the arbitrary 
signal constellation shown in Fig. 7, where the decision regions are de- 
noted by R;. The detector, which is specified by the decision regions, 
will declare that s“ has been transmitted if and only if the demodu- 
lated vector z falls inside R;. Because of various practical considera- 
tions, principally ease of implementation, the mathematically optimum 
detector will not always be the one which is built. 


3.1 Asymptotic (High SNR) Error Rate 
The probability of error is given by 


M 
Pe. = Di piP ei, (27) 
j=l 


where the p,’s are the (taken to be equal) a priorz probabilities and P.; 
is the conditional error rate. The conditional error rate is just the prob- 
ability that z falls outside R; when s‘ is transmitted, 1.e., 


P.; = Pr{z €& R;|s™ transmitted ]. (28) 


TWO-DIMENSIONAL SIGNAL CONSTELLATION 941 


e Sk 
_ DECISION REGION 
a BOUNDARY 
es, 
we MIN. \ 
DISTANCE \r 
4 (LARGE 
PHASE 
| JITTER) 


\ 


— = 

Sm 

MINIMUM DISTANCE 
| (NO PHASE JITTER) / 


\“ 


Fig. 7—Typical decision region about signal point s;, with minimum distances to 
boundary shown for Gaussian noise alone (no phase jitter) and for large jitter (angu- 
lar displacements highly likely). 


This quantity may be written, using the conditional density p,(z), as 


Pos = fp, Pie) (29) 


Let z; denote a point of global minimum for the function d? or D? on 
the decision region boundary and let M; be the number of times this 
minimum is achieved on the boundary. Let (u, v) denote an orthogonal 
coordinate system erected at z; with the positive u axis pointed along 
the boundary line in a clockwise direction and v pointed outside R,. 
Then, as is shown in Appendix A, for a high signal-to-noise ratio 
(No — 0) with aN» = k, the conditional error rate is given by 





MM; N — =e 
“Ov Ou? 








For the case of Gaussian noise alone (no phase jitter) d? becomes the 
ordinary Euclidean distance and the partials in (30a) are easily evalu- 
ated. In this case, with s“ the signal(s) closest to s‘, we have 


wee. gs 4 g(a 
7 2 
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and 
dd? 


a = ls = s (|| 





=z* 
z=25 


07d? 


ae = 2. 








ee 
Z=2y 


Hence, for the Gaussian case, 


MyM Is — 8 
fa Qs© — sO] Va i SNo ) (30b) 


a useful formula in its own right. 

‘In terms of the distance functions d(-) and D(-), the asymptotic 
error rate is determined by the point on the decision boundary “‘closest’’ 
to the transmitted signal. Of course, in the presence of phase jitter, 
this point will generally differ from the closest point according to 
a Euclidean measurement. In Fig. 7, the indicated point on the vertical 
boundary segment is the closest point to s; in the Euclidean sense. 
As phase jitter increases, those points with a radial coordinate nearly 
equal to that of s; becomes closer to s;. So for large phase jitter, 
the point indicated on the boundary segment above s; is the ‘‘closest”’ 
point. Thus the exponential decay in error rate is quite similar to the 
asymptotic Gaussian result since it is of the form 


exp [—dmin(j)/2N 0], (31) 


where dmin(j) is the minimum distance (measured in a non-Euclidean 
manner) to the jth decision boundary. The minimum distance can 
sometimes be obtained analytically, but most often must be obtained 
by a computer search of the boundary. . 

The minimum distance to the jth decision boundary is particularly 
easy to determine if the function d(-,-) is a metric which possesses the 
midpoint property, i.e., for each distinct pair of points s“ and s‘ 
there exists a third point z* such that 


Ld(s, 8) = d(s, z,) = d(zy, 8). 


If this is the case, let z; denote any point on the decision boundary be- 
tween s“ and s and the triangle inequality gives 


d(s, z:) + d(s™, z;) 2 d(s@,s™). (32a) 
Since for maximum-likelihood detection 


d(s™, Z;) = d(s™, Za), 
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minimizing both sides of (32a) over the decision boundary gives 


d(s, z*) > min dd(s®, s‘9)) (32b) 
iAj 


where z* is the point on the boundary closest to s“. Note that (32b) 
would provide an upper bound on (31). Since the midpoint z* clearly 
lies on the boundary, z* = z,, and (32b) is satisfied with equality, 
the minimum distance is given by 


d(s®, z*) = min 4d(s, s), (32c) 
jAi 


Since the Euclidean metric is convex, eq. (30b) could be directly 
obtained from (30a) by using (82c). 


3.2 An Upper Bound on the.Error Rate (for Small Jitter) 


In systems which use a tracking loop, an upper bound on the system 
error rate may be obtained, for small jitter, by considering Fig. 8. This 
figure shows the transmitted point s‘, the decision boundary R;, and 
several nested regions C; (defined by contours of constant probability), 
where 

Cj = {z:d*(z, 8s) S ej}. 





~, ie 
~—_ 


Fig. 8—Nested equidistance contours [distance defined by eq. (13) _] about signal 
s; in arbitrary decision region. Contour c;*, at distance d;*, defines d;* as the shortest 
distance to the boundary. 
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If we let C} denote the first contour which touches the boundary, then 
it is clear that 


P.; = Pr[z & R;|s® transmitted] S Pr [z & C3/s@] 


=f ey _ Pile) 


1 
=f hy 5} = oe exp | - aN, d*(z, s) | dz. (33) 


For small jitter, the exponent may be expanded in the first power of 
1/a, to give 


d(x, 8) = [lx — sl? — Sy (x, st (34) 


for which the contours of constancy are ellipses. Transforming the 
ellipses into circles and changing to polar coordinates enable us to inte- 


grate (33) to get 
oR (eee 
aN 


Again di}, is the minimum distance to the jth decision boundary which 
for convex polygonal decision boundaries can be determined analyti- 
cally. For high SNR, the above bound is useful up to 1.5 degrees rms 
jitter. Because of the similarity of (80a) and (35), we will use only the 
former asymptotic results in the sequel. 


Pj < p [— AD /2N 0 |. (35) 


Iv. A COMPARISON OF VARIOUS SIGNAL CONSTELLATIONS 


In the preceding section we have presented a means of evaluating the 
asymptotic (high SNR) error rate for a given signal constellation and 
detector structure. In terms of the minimum distance, measured via 
the appropriate noise/phase-jitter distance function to the jth decision 
boundary, we have 


rena... Seema _ 1 oy 
FP; 2. Di 2||s@ a s()]|| r exp | 2No dt: (36) 


The minimum distance will be obtained by a computer search of the 
decision boundary. It should be emphasized that comparisons based on 
the asymptotic error rate are not exact but rather indicate order-of- 
magnitude effects. 
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Our comparisons will be made by varying the following quantities: 


(¢) signal constellations 
(27) signal-to-noise ratios 
(az) rms jitter 
(qv) decision boundaries 

(v) phase-error density. 


Clearly the pie may be sliced several ways, so let us first say a few 
words about each of the above variables. 

(z) Szgnal constellations: For the purposes of signal evaluation we 
will consider the existing 16-point constellations QAM, 8-8, and (4, 90°) 
shown in Figs. 9a through 9c. The circular constellation (4, 90°) has 
signal points equally spaced (i.e., 90 degrees apart) on four circles. This 
large angular spacing of points on the same circle suggests that this 
constellation will be insensitive to small amounts of phase jitter: The 
ratio of outer radius to inner radius (r2/7r:) for the 8-8 constellation is 
1.59, found by Lucky! to minimize the error rate in Gaussian noise. 
8-8 is an optimized form of AM-PM in which signal points on the outer 
circle do not lie on the same radial lines as those on the inner circle. It 
offers an order-of-magnitude improvement (over AM-PM) in error rate 
in the presence of Gaussian noise. We will also consider the new circu- 
lar modulation formats 1-5—-10, 1-6-9, 5-11, shown in Fig. 10. The 
optimum ratio (r2/r1) is very close to 2 for these constellations as we 
have determined by equating the three smallest nearest-neighbor 
distances. 

(2¢) We consider peak and average SNR’s, Ppx and Payg respec- 
tively, chosen so that P, (no jitter) is <10~-5. These quantities are 





(a) (b) 


Fig. 9—Existing signal constellations: (a) quadrature amplitude modulation 
(QAM), Dpeak/Pav = 1.8; (b) modified AM-PM (8-8), Dpeak/Pav = 1.43; (c) circular 
constellation, (4, 90°) Dpeak/Pav = 1.85. 
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HOES 


(c 
Fig. 10—New signal constellations: (a) 1-5-10, Ppeak/Day = 1.42; (b) 1-6-9, 
Ppeak/ Dav = 1.525; (c) 5-11, Dpeak/ Pav = 1.31. 


defined by 
Pox 


max ||s|2/2N 
7 


1 &# ; 
Pure = TH be Ils‘ ||?/2No. 


(477) Our attention is focused on the practical range of residual 
(from a PLL) jitter of <3 degrees rms. 

(iv) We will consider both the boundaries which are optimum in 
Gaussian noise (straight lines) as well as more practical boundaries for 
the circular formats (polar wedges). 

(v) The Tikhonov density will be taken as representative of those 
systems which use a tracking loop, while the peak-to-peak density will 
be used to model the raw (untracked) phase jitter. 

The error rate curves are grouped as follows: Figure 11 shows the 
error rate vs rms residual phase jitter (Tikhonov density) under aver- 
age and peak power constraints for the six constellations described 
above. For each constraint, a Gaussian noise power (and thus an SNR) 
is assumed which places the curves in a useful operating range, and the 
recelver is presumed to use the Gaussian optimum decision region 
boundaries. These boundaries are piecewise-linear contours constructed 
from segments of perpendicular bisectors of lines joining signal points as 

-shown in Fig. 12a for the 1-5-10 constellation. This is equivalent to 
deciding in favor of the signal point closest in Euclidean distance to the 
demodulated point. Figure 12b shows a more “practical” set of deci- 
sion boundaries for the 1-5-10 constellation. Figure 11 indicates the 
immunity of the (4, 90°) constellation to small amounts of phase jitter 
at the expense of an error rate more than an order of magnitude greater 
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Fig. 11—Error rate vs jitter with a phase-locked loop: (a) average power con- 

straint; (b) peak power constraint. 




















PROBABILITY OF ERROR 


than those of the 1-5-10, 1-6-9, and QAM constellations. In practical 
operation with a tracking loop, any of these alternative constellations 
will almost always outperform the (4, 90°) constellation. 

Figure 13 shows the error rates vs SNR, again under average and 
peak power constraints, in the presence of Gaussian noise alone (no 
phase jitter) and with 1.5 degrees rms residual phase jitter in addition 





Fig. 12—Decision region boundaries for part of 1-5-10 constellation: (a) Gaussian- 
optimum decision region boundaries; (b) practical decision region boundaries. 
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20 205 21 21.5 22 22.5 23 21.5 22 22.5 23 23.5 24 24.5 
SNR IN DECIBELS 


Fig. 13—Error rate vs SNR for channels with and without phase jitter; Gaussian- 
optimum receiver is assumed to have a phase-locked loop: (a) no jitter, average power 
constraint ; (b) no jitter, peak power constraint; (c) rms jitter = 1.5 degrees, average 
power constraint; (d) rms jitter = 1.5 degrees, peak power constraint. 
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Fig. 14—Probability of error for receiver with ‘‘practical’’ decision region bound- 
aries for the 1-5-10 constellation and with phase-locked loop: (a) average power con- 
straint; (b) peak power constraint. 


to the Gaussian noise. Although some of the curves shift their relative 
positions (at least under the average power constraint) when phase 
jitter is added, the good performances of 1-5~10 and 1-6-9 are main- 
tained. QAM performs respectably and the (4, 90°) constellation comes 
in last. 

Figure 14 is similar to a reduced Fig. 11 except that the “‘practical’’ 
set of decision region boundaries is presumed for the 1-5—10 constella- 
tion. As can easily be seen, the Gaussian optimum boundaries for QAM 
are also practical boundaries, and the jitter-immune (4, 90°) constel- 
lation is shown to best advantage by presuming Gaussian optimum 
boundaries. Under the average power constraint, QAM is superior to 
1—5-10 below about 1.5 degrees rms jitter. Under the peak power con- 
straint, 1-5-10 is uniformly superior to QAM. The (4, 90°) constella- 
tion does not show an advantage until the rms jitter reaches 2.5 to 3 
degrees. 

Figure 15 is a set of error rate vs SNR plots with practical decision 
region boundaries for 1-5~10. Curves are shown for Gaussian noise 
alone (no jitter) and 1.5 degrees rms jitter. Only the average power 
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PROBABILITY OF ERROR 





SNR IN DECIBELS 


Fig. 15—Probability of error vs SNR for Gaussian-optimum receiver with “practi- 
cal’? decision region boundaries for the 1-5-10 constellation and with phase-locked 
loop: (a) no jitter, average power constraint; (b) rms jitter = 1.5 degrees, average 
power constraint. 


constraint is presumed. As before, QAM shows an advantage in the 
absence of phase jitter and still does well in the presence of moderately 
severe residual phase jitter. 

Figure 16 presents some data for receivers which do not use tracking 
loops. In this case the peak-to-peak density of eq. (22) describes the 
raw jitter. Curves are plotted vs peak jitter under average and peak 
power constraints. Some interesting features are the resistance of 1-5- 
10 up to a threshold. of about 8 degrees peak-to-peak jitter and the 
rapid deterioration of the performance of QAM. | 
_ Figure 17 shows the performance vs SNR for the receivers without 
tracking loops when. the peak-to-peak jitter is 12 degrees. This is the 
only instance for which the (4, 90°) constellation looks relatively good, 
but here, too, the 1-6-9 constellation performs slightly better. QAM, of 
course, does rather poorly. ee _ 

The advantage of using a phase-locked tracking loop with QAM and 
1-5-10 can be seen from the above data and a simple calculation. If 
the raw phase jitter is modeled by 


Q(t) = A cos [wt + ¥], 
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Fig. 16—Probability of error vs. peak-to-peak jitter for Gaussian-optimum receiver 
without a phase-locked loop: (a) average power constraint; (b) peak power constraint. 


where y is uniformly distributed from —7 to 7 and 2A is the peak-to- 
peak jitter, then a rule of thumb® suggests that the residual rms jitter 
out of a tracking loop is of the order of 0.1 X 2A. For A = 6 degrees, 
this rms value is 1.2 degrees. A comparison of the curves of Figs. 11 and 
16 shows that substantially lower error rates are achieved by the re- 
ceiver with a tracking loop. The performance of constellation (4, 90°) 
is relatively unaffected by the introduction of a tracking loop. 

A further conclusion that can be drawn from the numerical data is 
that the QAM constellation, which is simple to generate and to de- 
modulate, performs quite well in Gaussian noise alone or (with the aid 
of tracking loop) Gaussian noise plus phase jitter. More circular con- 
stellations, such as 1-5-10 and 1-6-9, appear to offer a moderate fur- 
ther advantage at the expense of greater complexity. 


V. OPTIMUM SIGNAL CONSTELLATIONS UNDER A PEAK POWER CONSTRAINT 


In this section we discretize the received signal space to obtain a 
tractable optimization problem. The discretizing is such that the JZ 
signal points are selected from a circle containing Z points while the 
received points lie in a circle of N (N > LZ) points (note: M < L). We 
make the following two comments concerning this approach to solving 
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PROBABILITY OF ERROR 
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Fig. 17—Probability of error vs SNR for Gaussian-optimum receiver without a 
phase-locked loop: (a) peak-to-peak jitter = 12 degrees, average power constraint; 
(b) peak-to-peak jitter = 12 degrees, peak power constraint. 


the problem: 


(z) the level of discretization must be fine enough to provide a good 
approximation to the continuous problem, and 

(22) the outer radius must be chosen so that for all practical purposes 
the probability that a received point lies outside the outer circle is 
negligible. The peak power constraint simply means that no signal 
points can be selected outside the L circle. 


5.1 Discrete Maximum-Likelthood Formulation 


If we denote the received point, z, by “2”? and the transmitted signal, 


s”, by “7,” then the perturbation of the transmitted signal due to 
Gaussian noise and phase jitter may be summarized by a transition 
matrix whose elements are defined by 


p(i|7) = Pr [receiving ‘2’’| transmitting ‘7’ ] 
PS 40 coun. (37) 
9 = 1, 2; --- MM. 


The transition probabilities may be computed by integrating the condi- 
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tional densities p;(z) over an appropriate region. It is convenient to 


work with the maximum-likelihood receiver (i.e., the optimum decision 
66,999 


boundaries are used) which receives ‘2’ and declares that ‘‘¢,;’ was 
transmitted, where 
pilti) > pt), 7A 7,46=1,2,---M. (38) 


Note that we have (temporarily) fixed the M points in the signal con- 
stellation. It is easy to see that the probability of being correct is given 
by 


N 
Pr [correct ] = >- Pr [correct|receive ‘2’ ] Pr [receive ‘2’ ], (39) 
isl 


but by Bayes rule 
Pr [correct | receive ‘’’ ] 
= Pr [send ‘“é;’’| receive ‘7’? ] 
_ Pr [receive ‘2’ |send “‘é;’ ] Pr [send ‘2’ ] 
Pr [receive ‘‘2’’ ] 


Substituting (40) in (39) and recalling that the transmitted signals 
are equiprobable gives 


(40) 


N 
Pr [correct ] = a > Pr [receive ‘2’ |send ‘“¢;’’ ] 
i=1 


| ee 
= iL Pla), (41) 


and the optimum constellation is the M signals (or columns) that 
maximize 


N 
X p(ild). (42) 


Note that since p(z|¢:) is the maximum entry in the ith row of the 
transition matrix, the problem is one of selecting M out of L columns 
such that the sum of the row maxima is maximized. The error rate may 
be determined from (41). 


5.2 Optimum Constellations 


A heuristic program for solving the combinational optimization 
problem posed by (42) has been developed by Kernighan and Lin.°® 
Their process is based upon iterative improvement of either known 
initial constellations or random initial starts. For each start, a “locally 
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optimum”’ solution is found in the sense that no change of position of 
a single signal can improve the criterion. The heuristic process is very 
fast and, for the resolution we require, 20 random starts can be pursued 
to completion in 25 seconds. For particular values of rms phase jitter 
and noise power we find, among the best of 20 local optima, reaffirma- 
tion of known solutions and in some instances new competitive con- 
stellations. For example, for a peak signal energy of SNR = 22 dB, 
Figs. 18, 19, and 20 give the best among the 20 local optima for an 
rms of 0, 1.5, and 3 degrees, respectively. As expected, the 0-degree 
solution has a 5-11 character and the 1.5-degree solution has a 1-5-10 
character. On the other hand, the 3-degree solution is somewhat of 
a surprise; it has a 1-6-9 character. The (4, 90°) constellation, which is 


e e  ] e . 
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Fig. 18—Optimum signal constellation: Peak SNR = 27 dB, no jitter (courtesy of 
B. W: Kernighan and S. Lin). 
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Fig. 19—Optimum signal constellation: Peak SNR = 27 dB; 1.5 degrees rms jitter 
(courtesy of B. W. Kernighan and 8. Lin). 


best at 3 degrees among heretofore considered designs, has an error 
rate only a few percent worse than that of the 1-6-9 constellation: 

A byproduct of the development of the above procedure is the 
demonstration of the fact that numerical quadrature routines offer 
a competitive alternative to asymptotic techniques and bounding 
methods for the estimation of system error rates. 


VI. CONCLUSIONS 


Comparisons have been made of several well-known two-dimen- 
sional signal formats in the presence of Gaussian noise and phase 
jitter, at high signal-to-noise ratios and under both peak and average 
power constraints. It has been demonstrated that, under an average 
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Fig. 20—Optimum signal constellation: Peak SNR = 27 dB, 3 degrees rms jitter 
(courtesy of B. W. Kernighan and S. Lin). 


power constraint for systems which have a high-quality phase-locked 
loop (rms residual jitter <1 degree), QAM had the lowest error rate 
of all candidate constellations. If the residual jitter is <1.5 degrees 
rms, the 1-5-10 constellation becomes extremely attractive since it is 
immune to phase jitter in this range and provides the same asymptotic 
(no-jitter) error rate as QAM. For small amounts of jitter, 1-5-10 and 
QAM have a 2-dB SNR advantage over the (4, 90°) constellation which 
is “immune” to phase jitter. Both these constellations offer a 0.5-to-1- 
dB advantage in SNR over conventional AM/PM signaling techniques. 
Thus, under an average power constraint, both QAM and 1—5-10 merit 
consideration. 

For a peak power constraint, in addition to making comparisons 
similar to the above, we have been able to attain the optimum signal 
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constellations for various levels of jitter. Our comparisons indicate 
that, for jitter <1 degree, QAM suffers a 1.5-dB SNR penalty with 
respect to the 5-11 constellation, while 1—-5-10 suffers a 0.1-dB penalty. 
At 1.5 degrees rms jitter, 1-5-10 again is superior to both QAM and 
5-11 (by 4 and 1 dB respectively). 

Based upon the peak and average power constraints, the new modu- 
lation format 1-5-10 appears to make extremely efficient use of avail- 
able signal power and for a slight increase in modulation/demodula- 
tion complexity offers considerable immunity to moderate (<1.5 
degrees rms) residual phase jitter. QAM systems which employ high- 
quality phase-locked loops will also be operating very efficiently, pro- 
vided that the residual phase error is <0.8 degree rms. 
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APPENDIX A 
Method of Laplace 


Let g(x) and h(x) be continuous real functions on [a, b | where h(x) 
is also twice continuously differentiable. Then, if A(z) attains a single 
maxima atc (a < c¢ < b), we have that 


b —= 
[ g(xyeUORodg ~ g(c)e le aa) Tae (k > 0) 


(read ~ as asymptotic to). The proof is not difficult and the key steps 
can be found in Papoulis” or in Jones." This analysis technique for 
estimating an integral for large values of the parameter k is called the 
method of Laplace. 

An immediate application of this method used in the body of this 
paper is 





e& 


V270 


For estimating system error rates, a certain two-dimensional version 
of Laplace’s method is needed. Particularly, we shall investigate the 
following two-dimensional integral: 


4 oy) exp { jMa) ae [z = (u,v) ] 





(ao), 


Ia) 4 = [i exe ea9 ~ 
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for small k. The function h(z) we shall be concerned with is assumed 
to have the following properties: . 


(t) h(z) is twice continuously differentiable such that h,(0) < 0, 
hu(0) ¥ 0, and 0 is the unique point of maximum for h(z) in 
v 2 0.* Thus we also assume hy» of (0) < 0. 
Let G, = {uv 20,wW +0? $7} 
(zt) For some r > 0, 


tim ff exp {em} /1 3. 


e(1/k)h (0) al exp § [h(z) — (0) ; 


{v=} 


Rewriting J as 


it is easy to conclude that for each e > 0 


Fas etna f” i: exp 1; [h(z) — wo) }} 


We shall proceed to integrate with the exponent in the integrand re- 
placed by its local representation 


gc nco [ exp 1; [h,(O)v + hunw(O)u2/2 + haa(Oue]} dudu. 


The absence of h.(0) and h,,(0) follow directly from (7). Integrating 
(dv) we get 


i 
I~ ecnrmne {* pare? 1; [h.(O)e + hus(O)ue]} —1 


MK elk (huw) (u2/2) Gay, 


We appeal to the first paragraph to integrate each term involved in 
this subtraction. Take « small enough to avoid the singularity at 
u = —h,(0)/h»(0). The first term is asymptotic to 


—27 1 h2 
3/2 ~1 (0) + hi(O)e — 2H 2 
Ieaal0) few i i ) + ho(O)e hs i} 
*In this appendix, subscripts are used to denote partial derivatives, @.g., 


oh 








(u, v)=(0, 0) 


TWO-DIMENSIONAL SIGNAL CONSTELLATION 959 


while the second is 
k3/2 —2Qq7r 


AT ph sk, 


~ hy(0) Vhuu(0) 
Since for e small enough h,(0)e — [h2,(0)/huu(O0) Je? < 0, we conclude 


— [3/2 es 
Ke | 2m ghiorie, 


~ 20) Viva) 

In error rate computations one is often integrating over the exterior 
of a convex polygon. In the body of this paper we encounter the case 
where h(z) has a finite number of global maxima on the boundary, at 
most one on each side, and none at the vertices. Let z; be the nth local 
maximum. Map the exterior half-space containing z, into the upper- 
half plane via a rotation, composed with a translation taking z, — 0. 
Then the method of the last paragraph can be applied. The process is 
repeated for each maxima and the results sum to the required asymp- 
totic estimate of the exterior integral. The fact that the exterior half- 
spaces containing distinct points of maxima may overlap is of no conse- 
quence. Furthermore, in our applications, h(z) is symmetric with re- 
spect to each z; and the process need only be completed once and the 
answer multiplied by the multiplicity of the maxima. 


I 


APPENDIX B 
The Nature of d(x, y) 
B.1 Introduction 


Let V be a vector space endowed with a scalar product (x, y) and 
a norm derived therefrom. It is not known whether d,(x, y): V X V > 
|R| defined by 


dy(x, y) = ([xll? + [lyll? + 2 — 2 v|lx|PFllyll? + 2x, y) + ¥7 14? 


is a metric for any values of y (0 < y < «). Nor is it known whether 
d.,(x, y) is a metric in any sphere centered about the origin. Similarly, 
the status of the midpoint property is also unknown to us. However, 
from Ref. 8, we know that these two properties cannot hold simultane- 
ously for any y in any sphere about the origin. 

Concerning the metric question, the difficulty (as usual) is the 
triangle inequality. By definition d,(x, y) is symmetric in its arguments. 
Positivity follows easily from the Schwartz inequality, 


d(x, y) = {\lx — yll? + 2(x, y) + 277} 
— 2{-vilxllilyl]? + 2yxiillyl] +71} = | Ill — llyll | 2 0. 
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Notice d,(x, y) > 0 unless « = y. If d,(a, y) is a metric on V X V for 
a particular value of y then it is a metric for all y (0 < y < ~); this is 


easily obtained by employing the mapping z > | Vy|z. 


B.2 One-Dimensional Case 


In the special case V = R we can show d,(z, y) is a metric. In one 
dimension we have (the “1” subscript will now be suppressed) 


dla y= {ee zy +120 
WV iW@ty +2] syt1so 


To show the triangle inequality, first notice that, if three points a, b, 
and c are on the same side of zero, the distances are all Euclidean. So 
for the remaining cases to be investigated we assume one point has 
a different sign than the other two. Notice d(x, y) = d(—x, —y) so we 
lose no generality by assuming c = b = 0 2 a. Two subcases remain: 
(¢) only d(c, a) is non-Euclidean; (77) d(c,a) and d(b,a) are non- 
Euclidean. For (2) the distances involved are (c — b), (b — a), and 
[L(a + c)? + 2?]/2|. Now |[(@ + c)? + 2?]!?| < c¢ — asince, by squar- 
ing, this is equivalent to 1 + ac <$ 0. To show 


(c — b) S$ (b— a) + |[a +c)? + 2? }?| 
(6—a) S$ (¢c— 6) + |[@+eo)? + 2°}"|, 


it is enough to show 
[(e + a) — 2b]? S [(a + c)? + 27] 

since squaring both sides when the right-hand side is positive can only 
weaken the inequality. The last inequality can be simplified to 
b? — bc S 1 + ab for which the left-hand side is negative and the right- 
hand side is not. On the other hand, (27) is immediate since c — }, 
v(a +c)? + 4, and v(b + a)? + 4 can be identified as sides of a tri- 
angle with apex (—a, 2) and base points c and b. Notice d does not 
have the midpoint property since d(10, —10) = 2, yet the only points 
y satisfying d(10, y) = 1 are 9 and 11, but both d(9, —10) 
d(11, —10) exceed 2. 

This last observation shows that the open spheres in this metric 
space are not all connected. In two dimensions, the boundaries of cer- 
tain open spheres are disconnected; specifically, it can be shown that 
for certain values of g > 0 


and 


tyld(x, y) = 9} 
is a disconnected set if |ly||?y > 1. 
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B.3 Approximating d,(x, y) on the Unit Circle 


As mentioned in the text, an important open question is whether 
d,(x, y) can be accurately approximated by a convex metric. There 
are, of course, many ways in which one can approximate d,(z, y). In 
this section we dispose of two approximations which suggest themselves 
immediately. 

Let us view d,(x, y) on the unit circle. Notice, as y — 0, 


[ Vilx|[llyll? + 2y-Mx, y) + v3 | 
is die (1 + (x, y) + y? (ee ery). 
Hence, for small phase jitter, 
d(x, y) ~ ||x — yll? — y|[xll?lly||? sin? 4, 
where @ is the angle between x and y, or what is the same, 


A*(x, y) \¥? 
Ix — sit) 


where A(x, y) is the area of the triangle Oxy. Let h., denote the altitude 
of Oxy perpendicular to xy; then 


d(x, y) = |x — yl]. — yhz,)¥?. 


d(x, y) ~ |x — yl (1 Wy 


Since h S 1, 
Ix — yll|(1 — yhi,)*? 2 0 


with equality if and only if x = y so long as y < 1; also, the left-hand 
side is symmetric. Notice 


A(x, y) = |x — yl|(. — hz,)1? 


has the midpoint property. Indeed, if z lies on the line segment joining 


x and y, 
d(x, Z) d(z, y) = d(x, y) 


since Aa: = Ray = hyz. 

So we are strongly motivated now to see if A(x, y) is a metric on the 
unit disc. Note that if F is any finite subset of the unit disc, then for 
y sufficiently small, d(x, y) is a metric on F. This follows from the fact 
that for noncolinear triples, the triangle inequality holds properly for 
the metric |/x — y|]. Perhaps surprisingly, the triangle inequality for 
A,(x, y) does not hold for all triplets in the unit disc. This was dis- 
covered by linearizing A(x, y) to get 6(x, y) = |x — yl|(1 — y/2h2,) 
which is likewise a valid approximation for d(x, y) for small y. 
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For 6(x, y), the midpoint property and all requirements for a metric 
hold except for the triangle inequality. The triangle inequality question 
is easier for us to investigate for 6(x, y) than for A(x, y) as it de- 
velops into a geometrical extremal problem which we can handle. In 
solving the extremal problem, a class of vector triplets in the unit disc 
emerge for which no value of y > 0 exists such that the triangle in- 
equality holds uniformly for the class. Although these triplets arise in 
the analysis of 6(x, y), they serve just as ruinously for A(x, y). 

In order to discuss this geometric extremal problem, we require some 
notation. Suppose we have a triangle (nontrivial—positive area) in the 
unit disc with side lengths a, b, and c and opposing vertices A, B, and 
C, respectively. Let ha, hz, and h, denote the distance from the origin 
to the line containing the side of length a, b, and c, respectively. 

Write the “triangle inequality’ expression for 6(x, y): 


Ix yl (1-$0%) & Ix — zl (1 $28) 
+ lz —yll (1 ~ 578): 


Changing to the notation just introduced, we have directly that the 
triangle inequality holds for 0 S$ y S$ y where y = inf I(a, }, c). 


a+tb—e 
ra, 4 | tab 


and the infimum is over all triangles for which the bracketed expression 
is positive. This geometric extermal problem is disposed of by substitut- 
ing a triangle of the form depicted in Fig. 21. It follows easily 
lim,.o lr = 0. Thus y = 0 and the only d(x, y) metric is the obvious 
one. A straightforward substitution of the three vectors depicted 
above into the “triangle inequality” for A(x, y) yields again that for 
no fixed y > 0 can the triangle inequality hold for this family of 
triangles. 

Strikingly, for the class of triplets in Fig. 21, itis easily demonstrated, 
via substitution, that the triangle inequality for d,(z, y) holds for an 
open interval including y = 0! 


B.4 A Metric on the Boundary of the Disc 


We end on a positive note: 6(z, y) is a metric on the boundary of the 
unit disc for a nontrivial interval [0, y].* It is enough to show inf 


* We anticipate applications in phase-modulation systems. 
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Fig. 21—Infimizing family of vectors. 


I'(a, b, c) > 0 where the infimum is over those triangles circumscribed 
by the unit circle for which I is positive. Now h? = 1 — a?/4 and 
similarly for hy and h, so 


_ ea? eb) 20) Vs 
rato = (1-75 GS) 
Dividing gives 


= _ 1a. 2 2 _ __ sabe - 
(a,b, ) = (1 i|2 +b? + ¢ ab + ac + be Peg ae 
So y > 0 if and only if 
abc 


@fb-e~ 


sup 
At this point, we must digress and recall some plane geometry from 
Ref. 12. First abc/4 is the area of the triangle and 2“'(a + b + c) is 
called a semiperimeter. Given any triangle ABC, extend the two lines 
emanating from the apex A. Construct bisectors to the two exterior 
angles complementary to the angles B and C respectively. The bisec- 
tors meet in a point equidistant from the three lines containing a, b, 
and c. The circle tangent to these three lines is called an excircle. See 
Fig. 22. The center of the excircle (where the bisectors meet) is called 
the excenter and of course the radius is called the exradius. Each tri- 
angle has three excircles. Finally from Ref. 12 we need: 
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Cc 


a b 





Fig. 22—Excircle tangent to c. 


UPPER BOUND 
ON EXRADII 





| 
| 
| 
| 
| 
| 
| 
a 


Fig. 23—Upper exradius bound for excircles tangent to c. Since 0 Sc S 2, the 
exradii are uniformly bounded. 
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Theorem: An exradius of a triangle is equal to the ratio of the area to the 
difference between the semiperimeter and the side to which the excircle 1s 
tangent internally. 


So if the set of exradii of triangles circumscribed by the unit circle is 
uniformly bounded away from infinity, then y > 0 and {6,}o<y<y are 
metrics. 

To complete the proof, consider any triangle with vertices on the 
unit disc boundary. With an isometric transformation of the circle into 
itself, we can situate c horizontally with apex C above c in the left 
half-plane. The more obtuse the exterior angles at A and B, the larger 
the excircle tangent to c. So take a horizontal line a’ through B and a 
line b’ going through A and (0, 1) and replace a and 6b with a’ and 0b’. 
Clearly, bisectors constructed using a’ and b’ will intersect at a point 
further away from c than the excenter of any excircle tangent to c. By 
similar triangles, the primed bisectors intersect at a distance 
2(1 + v1 — c?/4) from c where the sign is plus if c lies in the lower 
half-plane and negative otherwise (Fig. 23). 
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A Theory of Traffic-Measurement Errors 
for Loss Systems With Renewal Input 


By 8S. R. NEAL and A. KUCZURA 
(Manuscript received December 4, 1972) 


A theory of traffic-measurement errors for loss systems with renewal 
input ts developed. The results provide an accurate approximation for the 
variance of any differentiable function of one or more of the following basic 
traffic measurements taken during a given time interval: 

(t) The total number of attempts (peg count) 
(12) The number of unsuccessful attempts (overflow count) 

(ziz) The usage based on discrete samples (TUR measurement) or on 

continuous scan. 

The approximation is given in terms of the individual variances and 
covariance functions of the three measurements. Asymptotic approxima- 
tions for these moments are obtained using the concept of a generalized re- 
newal process, and are shown to be sufficiently accurate for telephone 
traffic-engineering purposes. 

As an application of the theory, we examine the variances of the standard 
estimates of the load and peakedness (vartance-to-mean ratio) of an input 
traffic stream for a time interval of one hour. Other possible applications to 
Bell System trunking problems are discussed. 


I. INTRODUCTION 


In the Bell System, there are a number of traffic measurements 
which can be made on any given trunk group. For a standard time 
interval (0, ¢] of one hour, the three most important measurements are: 


(t) A(é), the number of attempts (peg count); 
(iz) O(t), the number of unsuccessful attempts (overflow count); 
and 
(ciz) La(t), an estimate of usage based on 36 discrete samples (TUR 
measurement). 


When all three measurements are available, several statistics can be 
formed to estimate traffic parameters of interest. For instance, the 
967 
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ratio O(t)/A(#) is an estimate of call congestion. Two other important 
parameters are the peakedness (variance-to-mean ratio) and the load 
of the input traffic. An estimate of the load is given by the function 


. _ Lal(t)/36 
oe O(8)’ 
= Ta 


while an estimate of peakedness is a complicated function of A(t), 
O(t), and La(t) which is usually obtained by iteration using the Equiv- 
alent Random method.! 

Since the trunk-engineering procedures are based on such estimates, 
it is important to know their statistical accuracy. For instance, it 
would be useful to know the error inherent in a prediction of the re- 
quired size of a trunk group (to obtain a specified grade of service) 
based on the estimates of offered load and peakedness of the input 
traffic. Such a result could be used to determine the number of single- 
hour measurements necessary to ensure a desired accuracy in the pre- 
diction, to determine the optimum number of measurements from a 
cost-effectiveness point of view, or to evaluate the consequences for 
trunk provisioning of using a given number of measurements. 

Many results concerning the accuracy of the individual traffic mea- 
surements (2) through (777) have been obtained previously, but most of 
these have assumed the arrival process to be Poisson. For example, 
assuming Poisson arrivals, the variance of the usage measurement 
La(t) was obtained by Bene§&,? and the variance of the measured call- 
congestion O(t)/A(t) was given by Descloux.® More recently, the vari- 
ance of O(¢)/A(t) was obtained for arbitrary renewal input by Kuczura 
and Neal.* The variances of some nonstandard traffic counts were 
considered by Descloux,® and numerical results were obtained for the 
case of Poisson input. 

Using the concept of a multidimensional renewal process, we develop 
a general theory of errors which provides an estimate of the variance of 
any differentiable function of the measurements (7) through (777). Con- 
sequently, our results can be used to answer many questions similar to 
those mentioned above. The variances of the estimates of offered load 
and peakedness of the input traffic will be derived in Section IV as 
examples which illustrate our general theory. 

Section II contains the derivation of an approximation for the vari- 
ance of a function of the three traffic measurements. The approxima- 
tion is given in terms of the individual variances of (2), (2), and (272) 
and the covariance functions between them. 
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Section III contains the mathematical model used to derive the vari- 
ances and covariances. Section V contains a summary and an outline 
of other possible applications. 


II. STANDARD ERRORS OF FUNCTIONS OF RANDOM VARIABLES 


For completeness, we present those results from the theory of stan- 
dard errors® which are required below. Let &, &, and & be random 
variables and g a real-valued function. Assume that £ has a mean 
6:, g(&1, &2, 3) has finite mean and variance, and g is differentiable at 
the point (61, 62, 63). Using a Taylor series expansion we have, to first 
order, 


3 
g( Es, fa, &2) — g(Ba, 82, 62) ~ D ( — 65) SE. (1) 


We are assuming that the observation period will be sufficiently large 
so that the contribution of the higher-order terms can be neglected. 
This assumption will be justified in our model. 

Taking the expectation of (1), we see that the mean value of g is 
approximately g(@1, 92, 63) since ELé; — 6;] = 0. We also have 


Var [o(é1, £2, £3) ] 
= ~ 9) 99 |? 
aun > (Es: — 69) 56, | 


_ys (9 dg ag 

7 2 (3) Var [éi] + 235 30, 00> Cov [éi, &2] 
og og 
00; 003 


dg og 


06 003 Cov L&s, £3 ]- (2) 


Hee ee COV Leyes ee 
After setting &; = A(t)/t, & = O(t)/t, and &; = La(t)/36 the above re- 
lation becomes the starting point for our theory of traffic-measurement 
errors. It approximates the variance of any differentiable function of 
the measurements in terms of their variances and covariances. 

In the next section we derive expressions for the required moments. 
These, together with the first partial derivatives of g, approximately 
determine the variance of g. If the function g is too complicated to be 
differentiated analytically, differencing may be used to approximate the 
partial derivatives. An example of this procedure is given in Section IV 
where we discuss the variance of an estimate of the peakedness of a 
stream of offered traffic. 


970 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1973 


WI. MATHEMATICAL MODEL 


Consider a system of c servers serving customers whose arrival 
epochs constitute a renewal process. We assume that the interarrival 
times are independent and identically distributed according to the dis- 
tribution F having mean 1/\, and that the service times are in- 
dependent and identically distributed according to a negative-exponen- 
tial distribution with unit mean. If all servers are occupied when a 
customer arrives, he leaves and has no further effect on the system. If 
an idle server is available when a customer arrives, service begins 
immediately. 

Let (0, ¢] denote a time interval of length ¢ which commences at a 
stationary point for the arrival process.* (Such a point is often said to 
be chosen at random on the time axis.) Let A(t) be the number of 
arrivals and O(t) the number of blocked attempts in (0, ¢]. Finally, let 


iGe= J “C(w)du, (3) 


where C(u) is the number of busy servers at time u, be the total usage 
in (0, t]. Note that L(t)/t is a continuous-scan estimate of the carried 
load. 

As was pointed out in Section II, the individual first two moments 
of A(t), Of), and La(t) and the corresponding three covariance func- 
tions are sufficient to obtain an estimate of the variance of any func- 
tion of these measurements. By numerical experimentation, we 
found that the covariance functions Cov[_A(t)/t, La(t)/36] and 
Cov[O(t)/t, La(t)/36]| are, for our purposes, well approximated by 
Cov [A(t)/t, L(t)/t] and Cov [O(¢)/t, L(t)/t], respectively. However, 
the variance of L(t) can be significantly smaller than the variance of 
L(t) so that, in general, we must use Var [Za(¢) ] in our applications. 

In the next two sections, we derive the individual and joint moments 
of A(t), O(t), and L(t). In Section 3.3, we obtain the variance of L,(t). 


3.1 A Multidimensional Renewal Process 


Assume that the system described above is in statistical equilibrium’, 
let tn, n = 0, 1, 2, ---, be the instant of time at which the nth overflow 


*That is, the time until the first arrival after ¢ = 0 has the remaining life-time 
distribution H(t) = » fot [1 — F(x) ]dz. 

t That is, the system has been in operation sufficiently long prior to ¢ = 0 so that 
system-state probability distribution at ¢ = 0 is the limiting (or stationary) dis- 
tribution P{C(0) = k} = p, = lim... P{C(u) = k}. It follows that for any t=0, 
P{C(t) = k} = pu, ie., the process {C(é), t= 0} is stationary. 
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occurs, fh <0 <t) < te < +--+, and define X, = tn — tn_1. Now let 
Kn, n = 1, 2, ---, be the number of arrivals in (¢n_1, tn ] and 


1; =[- OOD bis: He eka: 
tn-1 


be the total usage in (¢n—1, tn]. 

Since holding times are exponential and arrival epochs constitute a 
renewal process, the sequence of times t,, n = 0, 1, 2, ---, are regenera- 
tion or renewal points in our model. Hence Xn, Ky, and Jn, n = 1, 
2, --:, are sequences of independent and identically distributed 
random variables. 

If we now define the row vector 


Xn = (1, Kn, In), MAN Di: Pee, 


then it follows that {X,, X,}, n = 1, 2, ---, is a multidimensional 
renewal process.’ Moreover, setting 


where the sum is taken over all n such that 0 < ¢, S t, we see that for 
large t, 
n(t) ~ (Od), AW), L@). 


Since this formulation corresponds to the concept of a generalized 
renewal process as communicated by J. M. Hammersley in the dis- 
cussion of W. L. Smith’s paper,’ his results apply directly to our model. 
In particular, we shall use his equations (25) and (26) to compute the 
moments of 7(t). 

Let un(c) = E(X7) be the nth moment of the interoverflow times 
from a group of c servers and 


y= fear: 


For brevity, we denote the arrival intensity vy’ by \. From Ref. 4, we 
have the first moments of A(¢) and O(t) already computed: 


ELA) ] =, 


4 
HELO] = — 

From eq. (25) of Ref. 7, we have 
ELL) = (5) 





ua(c) ’ 
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where 
wr(c) = ELIT]: 


Again, from Ref. 4 and Ref. 7, the variances and covariances of the 
three measurements for large t, omitting terms which behave as o(é), 
are given by 


Var [A(t)] ~ SLs — +2], 
Varro = a Cue) — u2(c)], 


Var [L(t)] ~ 5 {u2(c)eo2(c) + w2(c)ea%(c) 
— 2ui(e)wi(c)ELXili}}, (6) 
Cov [A(), O(t)] ~ =) {rue(e) — ELKiXs]}, 


Cov [O(2), L(t)] ~ {u2(c)wr(c) — wife) EL Xi; J}, 


to 
ui(C) 
CovLA@), L)] ~ {ur(c) EC Kili] + Auo(c)or(c) 


— Ami (c) EL Xi] — o1(c) EC X1K, ]} - 


to 
ui(c) 


We now need to compute the various moments and joint moments of 
X1, Ky, and J; appearing on the right-hand side of (4), (5), and (6) 
in order to evaluate the approximate expressions for the moments and 
joint moments of A(¢), O(¢), and L(t). Note that the mean and variance 
of A(t) are known since \ = »j' and v2 are computed directly from F. 


3.2 The Joint Distribution of Ki, X1, and I, 

The development here parallels that of Section 2.2 in Ref. 4. Let 
h.(w, r, n) be the joint density function defined by 

a 


dog (41 Sw Hs, Ki=n}- 


h-(w, rn) = 
By considering the two mutually exclusive events {the cth trunk re- 


mains busy throughout (0, w)} and {its complement}, and using a 
renewal-type argument, we arrive at the following integral equation: 


n—1 w u r—v 
hilto, *, 2) = €*healeo, r = 10,0) + / i J eh, a(t, 8, k) 
k=1 0 0 0 


Xh(w—u,r—s—v,n—k)dsdvdu, (7) 
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in which the time variables u and v run concurrently from an overflow 
epoch. 
The following boundary conditions hold: 


h(w,r,n) =0, for r>cw or r<0O, 
hw, cw, n) = e~’ f(w)61,n 


where f = F’ and 6;,, = 1 for n = 1 and is zero otherwise.* 
If we define 


v(2, ¥, 2) = s io ie e*’—uTh (w, r, n)z"drdw , (8) 
n=1/ 0 0 


then it follows from (7) that 
(y ete 1)yc—1(X =f y on 1, Y; z) 
A eee ss——sss:—«SOC 9 
WH) TTT oye ye) traetytiye © 


Motivated by the work of Riordan’ and the success of the approach 
taken in Ref. 4, we set 


(y + 1)D.(x, y, 2) 


; 10 
D.41(2, Y, z) ( ) 


ye(z, UP z) = 


where D)(x, y, z) = 1, and, as can be seen by setting c = 0 in (8), 


y tl 


D(a, Y; z) = 2o(x) ’ 


(11) 
where 
o(a) =f ewar(y- 
Furthermore, for m = 1, 
Din+i(X, Y; 2) 
= Daa, v, 2) + | 


If we now define 


yt dt 
z(x) 





1] Dale+y+ ty): (12) 

poe tos Ue Ai 
zo(a + jy + 9)’ 

then using (11) and (12) and mathematical induction one can show that 


Dalat, 2) = 1+ (- »i( Wakes Asa (14) 


Aj = A,(z, Y; z) =] (13) 


“In our model we assume that the interarrival-time probability distribution func- 
tion is differentiable. However, with more formalism, the same results can be obtained 
for the more general case; e.g., the one-point distribution function for constant 
interarrival times. 
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Now, since 


+7+k 
ELX{H KY] = (-1)* eagles Ye(2, Y, 2) z==0 
for k = 0, 1 and7z, 7 = 0, we can compute the required moments di- 
rectly by means of differentiation. Omitting all of the details of the 
operations indicated, we obtain 


u1(C) = »,D, 
w1(C) = DD; — 1. (15) 
where 
Ramen ee = (= yi(‘ ) Moda es 
with 
Cs ere 
‘ o(k + 1) 


Note that D, is the reciprocal of the generalized Erlang-B blocking 
probability B.. Moreover, with the aid of the results obtained in Ref. 
4, we have 


w0(o) = 2Lox(c) +1] L Lonts) + 1] — 20De + DY, 


wo(e) = 2 ar(o) + Qur(e) Do walle) ~ 2nsDE, (16) 
where 
DY = OTN D (x 2) . (17) 
c  Oxtaytdek oS? aes a 





The required derivatives are given by 


Dem = ¥ (11 (6) dota ean + Be os + Se], 
j=1 Aj-1 








OQ OM 7 

DY = Oe Ore [pe i) eo 
© = (-) € a ra ay [2+ Bh ve ot | 
where is the derivative of a 0, 1) evaluated at x = 1, ie., 
k 2 p) 
~ ri $2(k + 1) 
and Q, is the derivative of \,(1, y, 1) evaluated at y = 0, i-e., 
ko’(k + 1) 1 


oe ed) ae) 
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Similarly, we have the joint moments 


ELXih] = WwLor(e) + 1] Y Cox(9) +1) 
= 2. (1 + v1) D2 a v1 D010 
EL KyX1] = u(c) + ~ (0) + = pak) + DP? — DE, (18) 


ELKyl1] = 2fer(c) + 1] > [or(e) + 1] 


@1 — » (—1)5( © shiplieas |edit ee ccs 1 _ i. 
pom = FI (F) Aetenrt | tg to tS a 





3.3 Variance of Discrete-Scan Estimate of Usage 


Our present mathematical model assumes that the measurement of 
usage, L(t), is made by means of continuous scanning, as can be seen 
from the definition of L(t) in eq. (3). In practice, however, usage is 
estimated by discrete scanning. The number of busy trunks is sampled 
at constant intervals of time, say 7, and the integral in (3) is replaced 
by the finite sum 


Lat) = ¥ Oke), (19) 


where 
n(t) = max (k:kr S 2). 

This procedure introduces a sampling error in the evaluation of the 
integral. As we shall see later, the difference between the variances of 
L(t) and L(t) can sometimes be large enough that the discrete-scan 
variance of usage must be used to estimate accurately the variance of 
g in eq. (2). In this section we indicate how the variance of L(t) is 
computed. 

Let n = n(t) be the number of discrete samples in (0, ¢]. In trunking 
applications, ¢ is usually taken to be one hour (about 20 mean holding 
times) and, since 7 is normally set at 100 seconds, n = 36. Since the 
process {C(t), t 2 0} is stationary, from (19) we have 


Var [La(t)] = nR(0) + 2 (nm — AR(n), (20) 


where # is the covariance function defined by 


R(t) = ELCO)C@ ] — ELC(0) JELCH) J. (21) 
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Since P{C(0) = k} = P{C(t) = k} = pi, 


RO =X kpe L JP rst) — mi, (22) 
where 
P,,(t) = P{C@) = j|C(O) = K}, (23) 
and 
m, = = kpe- (24) 


The problem of determining the transition function in (23) has been 
treated by Takacs in Chapter 4 of Ref. 9. However, he uses a renewal 
point for ¢ = 0, 1.e., his origin is chosen at a point immediately after an 
arrival has occurred. His result, though not directly applicable, can be 
modified in a straightforward manner to take account of our different 
location of the origin. We state here the analogue of his Theorem 3 for 
the case of a stationary origin and give a proof in the appendix. We 
use 1/u to denote the mean service time throughout the statement and 
proof of the theorem. 


Theorem: Let t = O be a stationary point for the arrival process described 
above. Then the Laplace transform of (23) 1s given by 


mise) =f e*Pasat = & (-5(7) Bale), (25) 


=J 


where 


; = 1 — ¢(s + tu) Pails) k 1 d(s + in) 
bl) = Seay eri ta) GEL Ser 


vu) = |) / E (law 
LE QaallS Qeawiee a 
- fe aT nee : OT cera 
C;(s) = tearm ; 
C_i(s) = 1, 




















o(s) =f evar, 
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and (s) is the Laplace-Stieltjes transform of the distribution (24), that 
18, 
- aN 
&(s) = * [1 — 99). 
Taking the Laplace transform of (22), substituting for 7;;(s), and 


simplifying, we obtain the following expression for p(s), the Laplace 
transform of the covariance function R(t): 


nN 1 — ¢(s +1) mi 
a ef-s 2 


" [1 /¢ sp aol ) ae 
+ GED a0 i G) om 
_ 2 > i [ae — (J) B, 


ja2 (8 + 7)C;(s) $(J) 
+ 1 OTS) Bias | , (26) 











$(j + 1) 
where 
B=-G2(S)e 3 ar 
ee Nt a/ & a} C;’ 
i (ty) ; 
C= a aye NY 7=1,2,-:-, 
Dow - 
Ce=d: 


and m, and mz, the first and second moments of the distribution {p;}, 
are given by® 


£ oN 
Mm, = di kpxy = —(1 — B.), 
k=1 L 
= 2 kp, = Mm, + ALB, = cB, |. 


Note that m, is the carried load and B, is the generalized Erlang-B 


blocking 
1 
B.=1 ) — 
/ %, (5 C; 


Equation (26) has been inverted analytically for the case of Poisson 
input.? When ¢(s) does not have the simple expression of this special 
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case, analytical inversion appears to be complicated. However, for the 
purpose of computing the variance of L(t) for our trunking applica- 
tion, it is unnecessary to obtain an explicit inverse of p(s). We found 
that the numerical inversion scheme described by Ejisenberg?® is com- 
putationally efficient and gives satisfactory results. 

To illustrate the difference between continuous-scan and discrete- 
scan measurements for the case when the input traffic is of the overflow 
type, we computed the estimates of the variances of L(¢)/t, the con- 
tinuous-scan estimate of the carried load, and La(t)/36, the discrete- 
scan estimate, for various trunk-group sizes and t = 20 mean holding 
times-i.e., about one hour. For these results, the interarrival-time dis- 
tribution of the arriving traffic was obtained by using the Interrupted 
Poisson process with a three-moment match.!! 

The case for ten trunks is typical and is presented in Fig. 1 where 
or, = VVar [La(t)/36] and oz = VVar [L(t)/t] vs a are graphed for 
z= 1, 2, and 4. 

Since the variance of La(t)/36 must be at least as large as the vari- 
ance of L/(t)/t, our results show that the asymptotic estimate of 
Var [L(t)/t] has a small positive bias, especially for low loads. Our 
simulation results verify this observation and also indicate that the 
asymptotic approximation becomes more accurate as the input load 
increases. Notice that the variance of La(t)/36 is about equal to the 
variance of L(t)/t at low loads. As the load increases, the relative error 
introduced by discrete scanning can increase substantially. Finally, we 
found that for fixed load and peakedness, the relative difference be- 
tween Var [L,(t)/36] and Var [L(t)/t] decreases as the trunk-group 
size increases (an effect not shown in the figure). 


Iv. TWO APPLICATIONS 


We give two applications of our results, in which we obtain the ac- 
curacy of the estimates of two traffic parameters. In the first example, 
the parameter is the offered load as given in Section I. For the second 
example, we discuss an estimate of the peakedness of the offered traffic. 


4.1 Accuracy of an Estimate of Offered Load 


Suppose we have observations A(t), O(t), and La(t) recorded. Then 
for the measurement period (0, ¢], @, an estimate of the offered load (in 
erlangs), is given by 

La(t) /36 
= of AW, 00), Lal] = BOLT. (27) 
1 — A(d) 


TRAFFIC-MEASUREMENT ERRORS 979 


ANALYTICAL APPROXIMATION 


— — DISCRETE SCAN 
CONTINUOUS SCAN 


—_Q-—- ==> 
R =~ 
Yi -Q— oe 
/ / >. x ~ 
@ p =~ 
XS - 
SS 
XN 


POINTS OBTAINED BY SIMULATION 
ce) © DISCRETE SCAN 
x X CONTINUOUS SCAN 


STANDARD DEVIATION OF CARRIED — LOAD MEASUREMENTS 





OFFERED LOAD, a, IN ERLANGS 


Fig. 1—Standard deviation of carried load measurements vs offered load using 
discrete-scan and continuous-scan measurements on a 10-trunk group for ¢ = 20 
mean holding times. 


Obtaining the required derivatives of g as indicated in eq. (2), sub- 
stituting into (2), and simplifying, we have the following expression 
for the variance of the offered-load estimate in (27): 


Var [é | = BE { Var [O() ] + Var [La(t) | (a) 


+ B2 Var [A(é)] + 2 Cov [L(), Of] 
— 2B, Cov [L(t), A(t) ] — 2B. Cov LO), A(t) i}. (28) 


Now using eqs. (6), (15), (16), and (18) to substitute for the various 
quantities on the right-hand side of (28), we can compute Var [@ ]. 
To test the approximation (28), we computed the variance of @, as 
outlined above, for trunk-group sizes of c = 10 and c = 40 trunks, for 
input traffic streams of the overflow type having different combinations 
of load and peakedness values. We also used a computer simulation to 
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ANALYTICAL APPROXIMATION 
X POINTS OBTAINED BY SIMULATION 


STANDARD DEVIATION, o, , OF OFFERED — LOAD ESTIMATE 





4 8 12 16 20 
OFFERED LOAD, a, IN ERLANGS 


Fig. 2—Standard deviation of offered-load estimate vs offered load for c = 10 
trunks. The measurement interval is 20 mean holding times. 


estimate Var [@] at several points. The numerical results are displayed 
in Fig. 2 for c = 10 trunks and Fig. 3 for c = 40 trunks where 
go = VVar [&]vsqais displayed for z = 1, 2, 4, and 10 (again fort = 20 
mean holding times). 


ANALYTICAL APPROXIMATION 
X POINTS OBTAINED BY SIMULATION 


STANDARD DEVIATION, o,, OF OFFERED — 
LOAD ESTIMATE 





OFFERED LOAD, a, IN ERLANGS 


Fig. 3—Standard deviation of offered-load estimate vs offered load for c = 40 
trunks. The measurement period is 20 mean holding times. 
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The simulation results indicate that for c = 10 and c = 40 the 
asymptotic approximation for Var [é@] is quite accurate for all ranges 
of load and peakedness of interest in trunking applications. 

We obtained almost identical results for Var[@] regardless of 
whether we used Var [La(t)/36_] or Var [L(t)/t]. Apparently, for low 
loads (low blocking probability) the accuracy of & is dominated by the 
accuracy of the usage measurements while at high loads (blocking 
near 1) the accuracy of the call-congestion estimate is the dominant 
factor. Since the relative difference between Var [L.(¢)/36] and 
Var [L(t)/t] is small for low blocking probabilities, we see that the 
accuracy of @ is not significantly affected by the TUR sampling error. 


4.2 Accuracy of an Estimate of Traffic Peakedness 


When all three of the measurements A(t), O(£), and La(t) are avail- 
able for a final trunk group of ¢ trunks in an alternate-route network, 
the peakedness z of the input traffic is estimated in the following man- 
ner: First an estimate of offered load & is determined as described in 
the preceding section. Then an estimate 2 of z is obtained by iterative 
methods (using the Equivalent Random method!), such that a single 
overflow stream having load & and peakedness 2 would experience the 
call congestion O(t)/A(t) or, equivalently, the resulting carried load 
would be La(t)/36. 

Thus, there is a well-defined procedure for determining a unique 
value for 2 corresponding to A(t) = O(t) > 0 and La(t) > 0, ie., we 


have the estimate 
= 9 [40,20 20) 
t ’ t ’ 36 

in the required form. However, there is no explicit analytical expres- 
sion for g which can be used to obtain the derivatives needed in (2) to 
obtain the variance of 2. 

In such cases, it is natural to estimate the partial derivatives by first 
differences. For example, 


Og ~ gid + A), 62, 8s] — gf 1, 92, 43] 


2 
OX1| 6,,62,03 AA : (29) 


where A is a small positive number. Numerical experimentation indi- 
cated that A = 0.001 gives sufficient accuracy for the present applica- 
tion. Using the first-difference approximations as illustrated in (29) 
for the derivatives in (2) we have an estimate for the variance of 2. 
We computed the resulting approximation for c = 10 and c = 40 
trunks for a range of offered loads a, several values of peakedness z, 
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ANALYTICAL APPROXIMATION 
X POINTS OBTAINED BY SIMULATION 
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° 
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OFFERED LOAD, a, IN ERLANGS 
Fic. 4—Standard deviation of peakedness estimate vs offered load for c = 10 
trunks. The measurement interval is 20 mean holding times. 


and ¢ = 20 mean holding times. We also compared our approximation 
with results obtained by simulation. The results are displayed in Figs. 
4 (for c = 10) and 5 (for c = 40) where o, = VVar [2 | vs a is given for 
z = 1, 2,4, and 10. Note that o./z ~ 0.2 for large a, independent of c. 


ANALYTICAL APPROXIMATION 


X POINTS OBTAINED BY SIMULATION 


2.0 


= 
(=) 


ad 
a 


°o 
re) 





STANDARD DEVIATION,o,, OF PEAKEDNESS ESTIMATE 


° 
ms 


20 30 40 50 60 
OFFERED LOAD,a, IN ERLANGS 


Fig. 5—Standard deviation of peakedness estimate vs offered load for c = 40 
trunks. The measurement period is 20 mean holding times. 
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CONTINUOUS SCAN 


== -—— DISCRETE SCAN (TUR) 


STANDARD DEVIATION OF PEAKEDNESS ESTIMATE 





20 30 40 50 60 
OFFERED LOAD, a, IN ERLANGS 


Fig. 6—Comparison of the standard deviation of peakedness estimates using 
discrete-scan and continuous-scan measurements of carried load on a 40-trunk group. 


In general, the simulation results are in good agreement with the 
approximation. The curves are plotted either for a = z, or else for call- 
congestion exceeding 0.01, the range of interest for trunking applica- 
tions. When a is smaller than z or the call-congestion is much smaller 
than 0.01 (not shown in the figures), the value obtained from the ap- 
proximation for Var [2] occasionally tends to be larger than that ob- 
tained by simulation. Hence, the approximation may not be adequate 
for such applications. 

In the preceding section, we noted that essentially the same results 
were obtained for Var [@] regardless of whether we used Var [ L(t) /t] 
or Var [ La(t)/36 ] in the computations. In contrast, the variance of 2 
is very sensitive to the variance of the usage measurements. That is, 
Var [La(t)/36_] is required to obtain an accurate approximation for 
Var [2]. The errors which can result from using Var [L(¢)/é] instead 
of Var [La(é)/36 | are illustrated in Fig. 6 for the case of c = 40 trunks. 
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V. SUMMARY AND OTHER APPLICATIONS 
5.1 Summary 


We derived an approximation for the variance of any differentiable 
function of the three basic traffic measurements—namely, peg count, 
overflow count, and usage (TUR). The approximation is expressed in 
terms of the first partial derivatives or first differences of the function, 
and the individual variances and covariances of the measurements. 
Except for the variance of the TUR measurements, asymptotic ap- 
proximations for the required moments were obtained by an applica- 
tion of Hammersley’s generalized renewal theory. 

The variance of the TUR was given as a sum involving the covari- 
ance function for the number of busy servers at equally spaced scan 
intervals. The Laplace transform of the covariance function was de- 
rived and inverted numerically using an inversion technique described 
by Eisenberg. 

The results were then applied to obtain approximations for the 
variances of estimates of offered load and peakedness (variance-to- 
mean ratio) of a stream of traffic of the overflow type submitted to a 
loss system. The approximations were in good agreement with results 
obtained by simulation. 


5.2 Other Applications 
5.2.1 Offered Load Estimates Based on Usage Measurements 


At present, A(¢) and O(¢) are not always measured on primary high- 
usage trunk groups. Estimates of the single-hour offered load and call- 
congestion for such groups are obtained from the TUR measurement 
L(t) with an iterative procedure based on the Erlang-B theory. Using 
the techniques presented above, one could compare the accuracy of 
these estimates with that which would be obtained by using all three 
measurements. It should then be possible to evaluate the difference in 
statistical accuracy that results from using (or not using) the additional 
measurements. 


5.2.2 The Optimum Number of Single-Hour Measurements 


Normally, 20 single-hour measurements are used to obtain estimates 
§ of the correct number of trunks s required to obtain a specified grade 
of service. For example, on final trunk groups, the 20 single-hour esti- 
mates of call-congestion O(t)/A(t) and usage La(é) are first averaged 
and then used to estimate an average load and peakedness of the input 
traffic. These average values are used to obtain §. 
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It has been proposed that the number of measurements be reduced 
in order to lower the data handling costs. However, reducing the num- 
ber of measurements would increase the variance of §, i.e., decrease the 
accuracy of the trunk estimates. It appears that an optimum number of 
measurements could be determined by minimizing a function of the 
form 

eC(N) = K Var [S(N) ] + Cy, (30) 
where Cy is the cost of taking and processing N measurements, §(N) 
is the estimate of s based on N measurements, and X is a cost associ- 
ated with inaccurate trunk estimates. The precise form of the function 
might require modification. However, the basic idea is to trade off an 
increase in the accuracy of the provisioning process due to more ac- 
curate trunk estimates (as N increases) against a corresponding in- 
crease in cost. 

It appears that one can obtain an approximation for Var [s(N) ] 
using an extension of the ideas presented in Sections ITI and III in order 
to account for the effects of day-to-day variation in the offered loads. 
However, a realistic model to justify (30) or to obtain K and Cy will 
require further study. 


APPENDIX 


We prove here the theorem stated in Section 3.3. We shall need the 
following lemma. 


Lemma: For the model described in the text, let Y(t) be the number of busy 
servers at time t, t = 0 be a stationary point, and Y(0) = 1. Now let Yu 
be the number of busy servers found by the nth arrival and t, be the time of 
the nth arrival. For n = 1, 2,-+--, andr = 0,1, +++, ¢, define 


Ag?(s) = Bye ( 7) Y(0) = ib, (31) 


vals) = X AP. (32) 





and 


Then we have 


vio = [049 / £5) at] 


x c) 1 oo ee] 











J} Cz-1(s) (8 + ju) 


) , 
ee ‘e 1 saab It (33) 


J} Cz-1(8) O(s + Ju) 
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where 
(s) = Tr os EH) _ ee 
C;(s) = at Pie 6(s ae ip) ? J 0, 1, 2, ? 
C_1(s) = 1, 
=_ . —st 
o(s) =f" evarey, 
and 


NO ge if ke e-'dP(t) = | 7 e-*[1 — F(t) Jat 


r 
ar [1 — ¢(s) ]. 
The proof below is essentially the same as the proof of Lemma 1 of Ref. 
9, modified to account for the stationary origin. For n = 1, 2, ---, we 
have 





iE ern ( —) 
r 


en (etrn)e (/ i ) E{e*|Y, = jj, for j<c¢, 


ea (*) E{e*|Y¥, = ¢} for j=, 


¥ 5: = 7 tnti — tn = «| 


because under the given conditions Y, has a binomial distribution 
with parameters 7 + 1 (for 7 = 0,1, ---,c — 1) ore (for j = c) and 
e—#=, If we remove the condition tr41 — tn = 2, that is, multiply by 
QP {tn41 — tn S x} and integrate over all x, we obtain 


E jenn ( =o) i + 1 
Tr 


r 
for 7 = 0,1, -:-,c — 1 and 
— Yngt c = 
E 4 e7 stn : Yn =cp = o(s + 7p) . E{e-**|Y, = c}. 


If we multiply the corresponding equations by P{Y, = 7} and add 
them for 7 = 0, 1, ---, ¢, then we get 





Vise i| = ot (751) Bley. = 51, 


ASTY(s) = O(s + rp) | AP) + A? i(s) — (, i i ) A(s) | , (84) 
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forr = 1, 2, ---,c and 
A}(s) = $(s)[¢(s) ]”7. 
Since Y(0) = z and ¢ = 0 is a stationary point, we have 
Ap(s) = (*) &(s + ry). 
Forming the sum (32) we get 
ins) — (1) 66 + rw 
= $(s + ry) | Yl) + Pir-a(s) — (, oe :) #8) ‘ 


or 


_ __ o(s + ry) b(s + rp) 
VO) Ts wera) Ie Des tu) 


+ virals)— (2 1) velo) |+ (5) 
Dividing both sides of this equation by C,(s) we get 
1\ o(s + rp) _ (,< ) 
Wir(s) _ Wi,ri(8) ey () o(s + ru) Reo viele) 
C.(s) Cis) C,_1(s) 


Adding these equations over r, r — 1, r — 2, --- , 1 we obtain 


Hey < BPC GOA it # 1 
C.(8) one eet oe vee(8) = (ae 














@=1,2,---,¢. (36) 
Setting r = c in (36) we get 


w= EG) ee tae/ Gas @ 


Substituting (37) into (36) we obtain yi(s) for r = 1, 2, ---, c. If 
r = 0, then 


vals) = X $6)" 


_ ds) 
T— $6) 


This completes the proof of the lemma. 
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We now prove the theorem stated in Section 3.3. Again, the proof is 
a slight modification of the proof of Theorem 3 in Chapter 4 of Ref. 6. 
Let us define the binomial moments 


B.,(t) = 2D (7) Pal, 19 =O, 18s pe 


k= 


From the definition it follows that 


Pa) = & (-Dr# (7) Bald, (38) 


r=k 


so that setting 
Birls) = f° e~Bis(t)dt 
0 


and forming the Laplace transform of (38), we get eq. (25) of the 
theorem. It remains to determine £;;,(s). 

Let Y(t) be the number of busy servers at time ¢ and let ¢ = 0 be 
a stationary point. The times between those successive arrivals which 
find 7 servers busy are independent and identically distributed ran- 
dom variables. Hence, the sequence of epochs immediately preceding 
those arrivals which find 7 servers busy constitutes a renewal process. 
If Y(0) = 2, we will denote the renewal function of such an imbedded 
renewal process by /;;(t). 

It may be helpful to recall that 1;;(t) is the expected number of 
those calls which arrive in the time interval (0, ¢] and find exactly j 
servers busy, given that initially there are 7 servers busy. Hence, we 
can write 


Mit) = © Pits St, Yn = jlYO) = 4, (39) 


where ¢, and Y, have the same meaning as in the statement of the 
lemma. 

If no calls arrive in (0,¢], then Y(t) has the binomial distribution 
with parameters 7 and e~#‘, and B;,(¢) is given by 


(7) tt - FO 
where 


F(t) = J . (1 — F(a) Jae. 


If one or more calls arrive in (0, ¢], let the last call’s arrival epoch be 
u and at that instant let the number of busy servers be 7. Now Y(¢#) has 
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the binomial distribution with parameters 7 +1 Gf 7 = 0, 1, ---, 
c — 1) ore (if 7 = cc) and e- 4“, Thus, together we have 


B,(t) = (*) eT] — F(t) ] 


ai ( jt+ ') f ene — P(t — uv) dM,(u) 


j= r—1 r 


r 


t 
+ (*)/ erH(t—w) 1 — F(t — u) |dM;-(u). (40) 
0 
If we introduce the Laplace-Stieltjes transform 


his(8) =| e~ "dM ;;(t), 
then from (40) we have 
1 fee (fe — $(s + ru) 
(s + rp) rJ/1— o(s + rp) 


UF *) wit + (2) ae] an 


j=r-l1 


Bir(s) = 
From (39) we have 


wi(s) =X PUYa = HBlee|¥n = 3, YO) = 3} 
and hence by (31) and (32) we get 
£ ())nio = Eafem(") 
= vals), 


Thus, 6:-(s) can be written in the following form 


_1—¢(strm)[/t\1— os + 7) 
Bir(s) = stru |(*) aes 


+ verals) — (© 1) vues) |: 


Y(0) = i} 





+ Pir(s) 


If we take relation (35) into consideration, then this formula can be 
simplified to 


(3) = Last) vis) 4 (@)_ 1 fF _ det) 7, 
Bir(s) = d(s + ru) (8s + 7H) ear sem | 


This completes the proof of the theorem. 
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Switching Networks of 
Planar Shifting Arrays 


By R. S. KRUPP and L. A. TOMKO 
(Manuscript received June 2, 1972) 


An array of shift registers that may operate in two orthogonal directions 
can be called a planar shifting array. This article shows how two basic 
building blocks, fashioned from planar shifting arrays, may be inter- 
connected to form a time-division switching network of arbitrary size. The 
characteristics of magnetic bubble and charge-coupled devices are com- 
patible with the concept of planar arrays, and it 1s in these emerging 
technologies that switching networks of planar shifting arrays may become 
practical. 


I, INTRODUCTION 


Switching machines in the Bell System have grown in both number 
and capacity to meet the growing traffic demand. Early machines con- 
sisted of a small amount of distributed logic embodied in electrome- 
chanical devices but, as technology has permitted, the machines have 
evolved into largely solid state systems with central processor control. 
The environment in which switching machines must operate has also 
changed from a relatively small collection of analog voice grade circuits 
to an overwhelming number of circuits of various bandwidths with an 
increasing proportion of digital facilities. It is the purpose of this paper 
to look at a possible future realization of one portion of a switching 
machine that might have advantages in meeting future requirements 
in a largely digital environment. 

We may consider that a switching machine consists of three major 
subdivisions. One is a switching network, which makes cross connec- 
tions for each call. A controller, used to direct the operation of the 
network, is another. Finally, some interface is needed between the 
network, the controller, and external circuits. The subject of this paper 
is a switching network, one that may reduce the complexity of the tasks 
of the other two subdivisions of a switching machine as well as have 
advantages of its own in conjunction with some emerging technologies. 

991 
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Most switching networks in the past have been “‘space-division” net- 
works; that is, a spatially distinct path is assigned to each of the many 
simultaneous calls that might pass through the network. The present 
paper, however, refers to a ‘‘time-division” network, in which inter- 
leaved samples of several simultaneous calls may share parts of the 
same spatial path. A potential savings of equipment is implied by this 
time-sharing process, and its merits have already resulted in plans for 
a large-scale digital electronic switching machine with some time shar- 
ing in the switching network (the No. 4 ESS, now under development 
in Bell Laboratories).! 

The possibility of constructing a somewhat different time-division 
digital switching network from two basic building blocks is explored in 
this paper. The two blocks, or subsystems, may be realized in the form 
of planar shifting arrays which are basically shift registers that can 
perform shifting operations in two orthogonal directions. These capa- 
bilities seem to be consistent with those of the emerging technologies 
of magnetic bubble and charge-coupled devices,?:? which do shift data 
on a plane. Although these types of devices may be particularly well 
suited for use as planar shifting arrays, the concepts presented below 
are not restricted to implementation by any particular type of device. 

The first building block, a time-slot interchanger, is the only actual 
switching element in the system. The other type of block, a mass 
serial-to-parallel converter, performs a time-space mapping and thereby 
acts as the interconnection links between successive stages of time-slot 
interchangers. Networks of arbitrary size and blocking probability can 
be fashioned from these two building blocks. 

In the sections that follow, the basic interconnections of time-slot 
interchangers and mass serial-to-parallel converters necessary to per- 
form multistage switching functions are explained, and some logical 
details of the two building blocks are presented. Although the network 
is only a part of a total switching machine, the concepts presented here 
may simplify the interface with digital external circuits and may help 
reduce the burden on the central control processor. 


Il. NETWORK ARCHITECTURE 


In this section, multistage switching network structures are described 
that use pure time-division techniques only. Such a network may be 
diagrammed using two types of functional blocks, as in Figure 1. 

Each block labelled TSI denotes one time-slot interchanger—a 
familiar subsystem in the time-division switching art. The attached 
notation N X M indicates that the TSI rearranges words from an 
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Fig. 1—Three-stage switching network. 


N-slot input frame into an M-slot output frame. The TSI is the only 
actual switching element in the network. It corresponds to an N X M 
array of crosspoints in an analogous space-division network. 

Each block labelled S—P denotes a mass serial-to-parallel converter. 
The S—P does not perform switching functions, but corresponds to the 
links between crosspoint stages in a multistage space-division network. 
Its basic function is that of interchanging the space and time co- 
ordinates of each word it handles. As inputs to one S-P, the figure 
shows L time-division multiplexed lines, each carrying M time slots 
per frame. The output is / lines with L slots per frame. An input word 
in slot m of line ¢ will always be routed by the S-P to slot ¢ of output 
line m, where 1 S m S M and 1 S ¢ S L. Thus, the mth words of all 
L input frames are combined to form a single output frame on line m, 
and conversely the input frame on line ¢ gets distributed into the ¢th 
slots of all the M@ output frames. 

A three-stage switching network is diagrammed in Figure 1 using the 
two functional blocks. To illustrate its operation, suppose that a word 
in slot n of input line ¢ must be sent to time slot » on output line A, 
where 1 S ny S Nand 18S 4,\ S L. First an intermediate time slot 
m is assigned for some 1 S m S M, which permits the middle stage to 
complete a connection. Thereafter: 


(1) The ¢th input TSI switches word n to slot m. 
(ii) The first S—P places this word in slot f on line m. 
(117) The mth intermediate TSI switches word ¢ to slot X. 
(tv) The second 8-P places the word in slot m on line i. 
(v) The Ath output TSI switches word m to slot v, and the task is 
finished. 


The portion of the figure surrounded by a dashed line performs the 
same function as the time-shared space-division switch (TSSDS8) stages 
of the No. 4 ESS.! That is, all words that enter the first S—-P in time 
slot m will exit the second S—P in the same slot. In their passage through 
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Fig. 2—Spiderweb graph for Figure 1. 


the S—P’s and the mth intermediate TSI, however, these words may be 
spatially rearranged to different output lines. A direct realization of the 
same TSSDS function using magnetic bubbles has been suggested by 
P. I. Bonyhard.+ 

Figure 2 is the probability linear graph® (also called spiderweb or 
Lee graph) of all paths for one call in Figure 1. It is the standard Lee 
graph for a three-stage network and yields the usual blocking formulas. 
In particular, if 44 = N — 1 and we assume independent occupancy p 
of all input and output time slots, then a modification of C. Y. Lee’s 
argument’ yields the mismatch blocking probability first given by M. 
Karnaugh® in 1954: 


Ps = (p)¥-¥#I1 — (1 ~ p)*P8-¥-9(N = 1)12/ 
M!(2N — M—2)!. (1) 


On the other hand, if 4 < N — 1 and we assume all intermediate 
time slots m have independent occupancy p, a more appropriate esti- 
mate is: 

Pa Gah py (2) 


When M = 2N — 1 in Figure 1, Pz vanishes and the network is non- 
blocking.’ Figure 3 illustrates such a case for M = 2N. A part of the 
time-space interchange is accomplished, not by the S—P, but by split- 
ting each input and output TSI into two separate TSI’s. Now each 
TSI in the network handles input and output frames of equal size. This 
might prove convenient for certain applications or implementations. 
Note that Figure 3 may also be interpreted as a pair of three-stage net- 
works whose inputs and outputs are tied in parallel. A third, inactive, 
network could be placed in parallel as well, and held in reserve against 
failure of one of the first two. 

There is an unavoidable signal delay of at least one frame for each 
stage of the TSI’s. This could contribute to the cost of echo suppression 
on long toll circuits; however, the duration of the frame itself may be 
shortened within the switch to mitigate this effect. 

The three-stage network of Figure 1 has a capacity of C = LN 
terminations. Such networks may be nested to achieve a higher switch- 
ing capacity. A five-stage example having capacity C = JLN is dia- 
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NxN LxL NxN 
Fig. 3—Nonblocking three-stage network. 


grammed in Figure 4. Shown are M three-stage TSSDS sections 
within dashed lines, as well as one large overall TSSDS which re- 
arranges JL words spatially in each of 1 time slots. Blocking probabili- 
ties for the five-stage nested network are obtained by arguments simi- 
lar to those for eqs. (1) and (2), using the spiderweb graph in Figure 4. 

A different kind of five-stage network organization is shown in Figure 
5. No section of this network performs the TSSDS function. In fact, 
a word entering its first S—P in time slot m may exit its last S—P in any 
time slot uw for 1 S m,u S M, depending upon the path it follows 
through the network. The number J/? of possible pairs (m, ») yields 
the same number of possible paths for routing a message, as the spider- 
web graph in Figure 5 shows. At full occupancy, this graph reduces to 





Fig. 4—Nested five-stage network. 
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Fig. 5—Cross-connected five-stage network. 


Figure 2, but with M’ = (M — N + 1)? center-stage switches, so that 
(2) applies with link occupancies p = N(L — 1)/M? to yield the fol- 
lowing blocking probability expression: 


Py = [1 — (1 — p)e]orene (3) 


Thus, parameters L = N = 48 and M = 60 yield Ps < 107?° at any 
occupancy level; essentially, this is a nonblocking network. The same 
parameters in a nested arrangement produce a much higher degree of 
blocking, Pz > 0.027 at full occupancy, based on (1) for the case p = 1. 
To illustrate operation of the network in Figure 5, suppose that a 
word in time slot 7 on line ¢ of input block 7 must be transmitted to slot 
v on line A of output block T for 1 S$ ny S$ N and 1S 4,\ S L with 
1 < j,I s J. First, intermediate time slots m and » must be chosen 
for some 1 S m,u S M which permits completion of the connection. 
Then TSI’s in the five stages perform the following switching opera- 
tions consecutively to complete the task, as indicated on Figure 5: 


(t) Switch word n into slot m (Stage 1). 
(2t) Switch word ¢ into slot u (Stage 2). 
(zit) Switch word 7 into slot T (Stage 3). 
(tv) Switch word m into slot » (Stage 4). 
(v) Switch word uy into slot v (Stage 5). 


The networks of Figures 4 and 5 each have a center section consisting 
of the third-stage TSI’s and the two rows of S—P’s to which they 
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attach. A total of J independent input and output blocks of LN termi- 
nations each connect to the center. Each pair of blocks consists of an 
input S—-P, an output S—P, and their attached TSI’s in the first, second, 
fourth, and fifth stages. This structure immediately suggests an ap- 
propriate strategy for growing the network capacity in modules of LN 
terminations. 


III. NETWORK PATH SEARCH 


Section IJ mentioned the need to choose one or two intermediate 
time slots m before setting up a message path through the network. This 
amounts to specifying which of the many possible paths in the spider- 
web graph is currently free and will be used. The decision might be 
made by a central control processor after consulting its memory records 
of the current status of the network. In this section, an alternate pro- 
cedure is described by which some simple operations within the network 
itself can identify suitable paths for routing a new call. A possible ad- 
vantage is reduced demands upon processor time and memory for call 
processing. 

To provide necessary information about network status, a ‘‘busy- 
bit’’ is used. This is a bit in each word which travels through the net- 
work with that word and serves to indicate whether or not the word is 
part of a call currently in progress. The busy-bit may occur in every 
word of a message or less frequently, perhaps every tenth or hundredth 
frame. For concreteness, assume the busy-bit is ‘‘one”’ for a call in 
progress and ‘‘zero’”’ otherwise. 

To determine occupancy of a time slot on some line in the network, 
we merely consult the busy-bit in that slot. For example, in routing a 
word from input line ¢ of Figure 1 to output line \, an intermediate time 
slot m was chosen that was vacant both at the output of the fth first- 
stage TSI and at the input of the Ath third-stage TSI. Such m may be 
found by comparing the two corresponding streams of busy-bits. This 
is illustrated by Figure 6a for the case ¢ = \ = 1. Output from the first 
input TSI and input to the first output TSI are sent to a NOR gate. 
When simultaneous zeroes are found in the busy-bit positions of words 
m, an output pulse is produced by the gate identifying an available 
path through the network by its timing. 

Other means of sampling and matching busy-bits are possible. For 
instance, the stream of busy-bits from the ¢th input TSI will exit the 
first S-P simultaneously in word time ¢, while the busy-bits entering 
the second S—P at word time 2 are those for the Ath output TSI. 
Hence, timed sampling pulses to the two S—P’s can select the two de- 
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Fig. 6a—Path search in three-stage network. 


sired strings of busy-bits rather than use spatial selection of an input 
and output TSI. Such an option might serve to reduce the amount of 
control and signal wiring associated with the network. 

Once an intermediate time slot m is chosen, a central processor could 
issue the necessary orders to the first-, second-, and third-stage TSI’s 
to establish the path. But again some additional logical apparatus in 
the network could perform the same task. The significance of this ob- 
servation is that the dependence of the network upon external inter- 
vention may be reduced. Indeed, for each new message, the principal 
external control required would be a specification of which input and 
output terminations must be connected. Such a structure might lend 
itself, for example, to applications in which all processor functions are 
performed by a remote computer that sends its instructions to the net- 
work over a data link. 

The path search procedure in the nested five-stage network of Figure 
4 would consist of two nested three-stage path searches. That is, the 
busy-bit streams of the input and output TSI’s are compared, as above, 
to determine which one m of the M intermediate three-stage networks 
will carry the message. Then the mth three-stage network is searched 
for some intermediate time slot 1 S k < K which will complete the 
connection. 

A different search procedure is required for the cross-connected net- 
work of Figure 5, since all M? paths in the spiderweb graph must be 
tested. This involves sorting the M busy-bit streams leaving the second- 
stage TSI’s of the input block 7 by using them as inputs to an S—P with 
M output lines. The S—P outputs are then compared by a set of OR 
gates to the M busy-bit streams entering the fourth-stage TSI’s of the 
output block Tf. A ‘‘zero” at word time m from gate u identifies a pos- 
sible path in Figure 6b. Note that, if time slot m from the first-stage 
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Fig. 6b—Path search in five-stage network. 


TSI or time slot » into the fifth-stage TSI is busy, then path (m, yu) 
cannot be used. To eliminate such possibilities, all the OR gates should 
be pulsed during each busy first-stage slot m, and all gates u correspond- 
ing to busy fifth-stage slots should be pulsed for the entire frame, as 
is accomplished in Figure 6b by feeding parallel ‘‘ones” to the S—P. 
This last path search requires only one pass instead of two nested steps. 


IV. PLANAR SHIFTING ARRAYS 


To understand the role of planar array devices in the realization of 
the functions described above, it is helpful to study the building blocks 
at the logical level, with a notation suggestive of the two-dimensional 
nature of the devices. The device technologies mentioned in Section I 
differ considerably in their physical principles. Nevertheless, certain 
common features of their operation may be abstracted to aid in discus- 
sing their use for switching applications. To this end, a “planar shifting 
array’”’ (PSA) notation will be introduced, exemplified by Figure 7. 
Circles © represent fixed word-storage locations in one-dimensional or 
two-dimensional shift registers. Diamonds < stand for shifting ap- 
paratus, which can move the contents of each circle to the next in line, 
under control of clock pulses A and B. Gates allow transfer of 
individual words between the adjacent circles under control of switch- 
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Fig. 7—Switch elements for time-slot interchanger. 


ing signals n. The distinction between < and () is one of function 
rather than physical structure. In this section, a word, rather than a bit, 
is the basic quantum of information. It is not necessary to specify 
whether this word is in a serial or parallel digital representation or some 
other form. 

A basic scheme for performing switching functions in a time-slot 
interchanger appears in Figure 7. Gates 1 to 6 allow transfer of input 
words from a frame stored in the upper shift register into output slots 
in a frame stored in the lower shift register. The input and output 
frames move, relative to one another, in the two adjacent shift registers, 
and the switching strategy is: For each input word a, wait until its des- 
tined output slot 8 lies directly underneath and then transfer through 
the appropriate gate. 

It is important to note a particular difficulty. If both the input and 
output frames are shifted simultaneously, then word a in the diagram 
will pass slot 6 halfway between gates 3 and 4, so that it is not possible 
to transfer. This we will call the ‘“‘half-word’”’ problem. At least three 
kinds of solution can be suggested: 


(2) Alternate clock pulses A and B, so that only one frame shifts at 
a time. 


OSCOPDOPDOLGO CO 
oS 2 “oO -o @ © 


oy oe 
—ODOSDOSOSOLSO 


Fig. 8—Switch elements for time-slot interchanger with buffer. 
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(zt) Omit one set of clock pulses, say A, so that one frame is held in 
stationary buffers. 

(27) Let each circle contain only a half-word, so that two consecutive 
locations hold an entire word. Then it is never necessary to 
transfer between first and second half-word locations. 


A fourth solution, based on the technique of bubble expansion, has been 
found by P. I. Bonyhard.4 

The first solution above may be illustrated by Figure 7. The basic 
switching cycle would consist of the following four steps: 


(¢) Diamonds <a> move the input words right one slot. 
(77) Gates may transfer input words to output slots. 
(it) Diamonds <#> move the output words left one slot. 
(iv) Gates may transfer input words to output slots. 


For an M-word frame, switching is completed in M cycles. Each shift 
register operates / times, and the WM gates each have as many as 2/7 
chances to operate in every frame. 

The second solution to the half-word problem is illustrated by Figure 
8. The diamonds <a> read in an entire input frame, which is stored in 
buffers Oby simultaneous operation of all gates at the end of the 
input frame. Then the following two-step switching cycle: 


(7) diamonds <c> move the output words left one slot, 
(77) gates may transfer words from buffers to output, 


is repeated 2M times, and the M gates each have M chances to operate 
in an M-word frame. This scheme has the additional advantage that 
the input and output clocks need not be synchronous. Each solution to 
the half-word problem has advantages with respect to particular 
hardware. 

Figure 9 uses the PSA notation to illustrate a simple scheme for 
implementing the S—P function. The S—P shown has L = 4 input lines, 
each carrying six-word frames, and M = 6 output lines, each with 
four time slots per frame. The diamonds <a> operate to load an entire 
frame into the S—P from each input line. Then the diamonds oper- 
ate to unload an entire frame from the S-P onto each output line. 
Loading and unloading may proceed at twice the external basic word 
rate so that the S-P empties in time to receive the next frame, or else 
an alternate S-P may handle the next frame while the first one is un- 
loading. Other schemes permit the S—P to load and unload simulta- 
neously. While the S—P is drawn as a distinct unit in preceding figures, 
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Fig. 9—Mass serial-to-parallel converter. 


only its function need be distinct. The S—P as a device may be broken 
up and its parts integrated onto the same chips as the various TSI’s 
which it serves. Schemes to obviate the S-P completely using specific 
device properties have been proposed by W. F. Chow and P. I. 
Bonyhard.* 


V. MEMORY STRUCTURE 


In the switching schemes described above, each gate should 
operate at the same word times in each consecutive frame; it is natural 
to place it under the control of a recirculating local memory. This could 
consist of a shift register fabricated from the same materials as the 
switching elements and preferably integrated onto the same chips. 
Such an arrangement is diagrammed in Figure 10, using the PSA nota- 
tion. The switching elements appear at the right. The rest of the dia- 
gram is the local memory array to control these elements. Each shift 
register in the control memory moves its contents one word to the right 
under the action of clock pulse C whenever a pulse A or B occurs. A 
word in shift register n is read when it reaches the box , which is 
considered to generate a pulse n that operates gates (>| . For example, 
the gating operations might be realized through repulsion between 
control bubbles and message bubbles. Gates [") at the left start new 
words down the shift registers to accomplish recirculation of the local 
memory. 
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The size of the local memory array depends on the number of words 
Q in each register and the number R of such registers, which is the same 
as the number of switching gates. Although the exact size depends on 
the particular solution chosen for the half-word problem presented 
above, we can conclude in each case that the local memory in a TSI 
grows quadratically in frame size (as QR), while the number of switch- 
ing elements grows only linearly (as R). For moderate-size frames (say, 
20 words or more), each TSI becomes a large recirculating memory 
plane with a small quantity of logic elements at the edges. A word in the 
control memory corresponds most closely to a crosspoint in a space- 
division switch, since the number of crosspoints also grows as the square 
of the number of circuits, and each crosspoint stores just one bit of 
information. 


VI. CONTROL INSERTION AND ERASURE 


The switching operations of a TSI are fixed from frame to frame on 
a short time scale (say, thousands of frames); thus, the contents of its 
local memory array are fixed. On some longer time scale, though, it is 
necessary to be able to change portions of local memory; for instance, 
in response to external signals emanating from a central control 
processor. 

Specifically, one requires the capability to address single word posi- 
tions in the local memory in order to close or open a “crosspoint”? by 
writing or erasing a bit. In the case of erasure, it would be sufficient to 
simultaneously erase all memory locations affecting a given input 
word, or alternatively a given output slot. This is due to the following 
two properties of the TSI: 


(A) Each input word is switched into one output slot at most. 
(B) Each output slot receives at most one input word. 
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Fig. 10—Recirculating memory array for TSI gates. 
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Hence, one can provide for erasure of a control word without knowing 
exactly where it is in the local memory by erasing a whole class of 
words. 

The fact that local memory recirculates will greatly simplify the 
problem of addressing specific locations, since this allows access to any 
location from the edge of the array. Indeed, one efficient and convenient 
scheme would be to build an analog of the switching elements and oper- 
ate it backwards” to insert control bits into the local memory. Inser- 
tion could be directed by two external control pulses whose timing 
would specify an input word and the output slot for which it is in- 
tended. Such an approach is illustrated in Figure 11, in which switch- 
ing elements are deleted to concentrate attention on the memory plane. 
Control insertion elements appear at the right-hand side. A bit is in- 
serted in order to control gate n at word time m as follows: 


(i) At the start of a frame, ©) gates a “one” into the vertical 
shift register, where it propagates downward. 

(27) After n of the A clock pulses, gates operate to place the 
“one” in buffer () at the right of the nth recirculating shift 
register in local memory. 

(217) In the next frame, gates operate with the mth clock pulse 
C, to place the “one” in word m of memory line n. This is indi- 
cated formally by reading the ‘‘one’’ at , but the actual 
details of bit injection will depend on the type of hardware. 


It is clear now that any word location in the local memory may be 
addressed employing a pair of time-coded pulses J and J. The scheme 
shown would be particularly appropriate to the case of buffered input, 
since n then becomes the number of the input word to be switched. 
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Fig. 11—Addressing and erasure schemes for the TSI local memory. 
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Indeed, the nth shift register contains only those locations that can 
affect input word n. By property (A) above, at most one location on this 
memory line has nonzero contents. It is sufficient now to erase the en- 
tire shift register, by interrupting recirculation, for example; before 
a new control word is inserted. 

The left side of Figure 11 shows a simple scheme for erasure in 
the case of buffered input, which also guarantees that at most one 
nonzero word is resident in each memory line. Once each frame, 
the buffers O at the left are loaded with “ones.” Gates then 
start these ‘“‘ones’”’ down the various shift registers at the appropriate 
times to accomplish recirculation. Clearly, at most one nonzero word 
ean circulate. To erase memory line n, the “‘one”’ is simply deleted in 
its recirculation buffer for a single frame. 

A single time-coded pulse will suffice to specify erasure of any word 
in the local memory above, if the buffers are loaded serially. Just such 
an arrangement appears at the left side. Gate enters ‘‘ones’’ in 
the vertical shift register at each input word time, and these are loaded 
into the buffers by gates [2) at the start of each frame. By deleting 
the B pulse at the nth word time, a “‘zero”’ is sent to the buffer for the 
nth memory line, erasing it and opening any “‘crosspoint”’ which might 
affect the nth input word. The deletion of pulse B could coincide with 
pulse J, which routes a new control word to memory line n. Then the 
pair of external signals J and J would accomplish erasure together with 
address insertion. Examples at device level of this procedure have been 
given by P. I. Bonyhard and W. F. Chow.‘ 


VII. BUSY-BITS FOR PATH TAKE-DOWN 


The preceding section considered two schemes for erasure in the local 
memory: 


(¢) A direct instruction from the central control processor. 
(27) Automatic erasure when a new control word is inserted. 


A third scheme is to provide for automatic erasure when the message 
that is being switched terminates. This might be accomplished through 
use of the busy-bits introduced previously. Recirculation of a word in 
local memory would be made contingent on the presence of a “one” in 
the busy-bit position of the particular input word which that control 
word switches. When the message terminates, the path along which it 
was routed through the switch is taken down simply by sending through 
one word with a “zero” busy-bit. This function is analogous to that of 
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the sleeve-lead in electromechanical switches and might be considered 
an ‘electronic sleeve-lead” application of busy-bits. 

This strategy of making recirculation in the local memory contingent 
on busy-bits can be easily implemented in Figure 11. It is merely neces- 
sary to read the busy-bit stream for the input frame into the left-hand 
shift register and load it into the recirculation buffers once each frame. 
Now each busy-bit will prime the recirculation of the memory register 
associated with the word of that busy-bit. 


VIII. CONCLUSION 


The possibility of constructing a time-division switching network 
using two building blocks of planar arrays was discussed in this paper. 
The compatibility of planar arrays with the emerging technologies of 
magnetic bubble, charge-coupled, and bucket-brigade devices?? may 
lead to application of these concepts in the construction of relatively 
inexpensive large-capacity switching machines. 

The need for external connections to a network composed of these 
building blocks can be minimized by including path search and path 
maintenance functions with the blocks. As a result, a relatively small 
amount of information must be exchanged between the network and 
the supervisory processor, and some of the processing burden on the 
controller is shared by the network itself. 

Some examples of compatible network architecture have been given, 
although no specific design is proposed. The process of switching is ac- 
complished by time-slot interchangers within the network rather than 
a space-division crosspoint array, but the task performed by some of 
the described subnetworks, when viewed from their external ports, is 
the same as a time-shared space-division switch. 
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Charge Distribution in Buried-Channel 
Charge-Coupled Devices 


By W. H. KENT 
(Manuscript received January 11, 1973) 


This paper studies charge distribution in buried-channel charge- 
coupled devices. Detailed development of a one-dimensional electrostatic 
model is presented and a numerical solution of the resulting nonlinear po- 
tential equations ts described. Graphical results show the charge-filling 
mechanism and the relationship between the oxide-semiconductor interface 
potential and total free positive charge. 


I. INTRODUCTION 


This paper describes a numerical determination of the distribution 
of charge in a one-dimensional model of a buried-channel charge- 
coupled device (CCD).! Several calculations have recently been made 
of the static potential in CCD’s in the absence of stored charge.?-* In 
addition, the motion of the stored charge under dynamic conditions 
has been studied by means of essentially one-dimensional models which 
do not involve a true knowledge of the distribution of stored charge or 
the charge-carrying capacity of the CCD.?:> However, so far it has not 
been possible to calculate even the static stored-charge distribution in 
a two-dimensional model of a buried-channel CCD, much less to follow 
the motion of this charge under dynamic conditions. In this paper a 
start is made on the problem by calculating the static distribution of 
stored charge in a one-dimensional model of a buried-channel CCD. 
The resulting information on the charge distribution is of interest in 
itself. However, an additional important objective has been to find 
numerical techniques which can be extended to the two-dimensional 


problem. 
The paper is divided into three parts. Section I treats the physics of 


the model and Section III gives numerical results. Section II deals 
briefly with the numerical techniques used and it may be omitted by 
the reader without loss of continuity. 

1009 


1010 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1973 


7 SILICON DIOXIDE 
¥ 
METAL p—TYPE SILICON n— TYPE SILICON 
0 h, hy x 


Fig. 1—Buried-channel device structure. 


The buried-channel device has a layered structure which will be 
modeled in one dimension as follows (Fig. 1): 


02H h;: silicon dioxide (SiO) with relative dielectric constant 
€1/€o = 4. (eo is capacitivity of free space.) 
hy S x She: p-type silicon with acceptor number density N a(x) 
and relative dielectric constant ¢:/e, = 12. 
he SxS o: n-type silicon uniformly doped with constant donor 
number density Np and dielectric constant «3 = e2. 


The point x = 0 is a perfectly conducting boundary held at a potential 
V.. The potentials ¢1(x), ¢2(x), and ¢3;(2), in the SiOs, p-type, and 
n-type regions, respectively, satisfy the one-dimensional Poisson 
equation, 


d? ila ‘ 

BS) _p(ad/ey 4 = 12,3, (1) 
where the p;(x), the lineal charge densities, are nonlinear functions of 
the potentials ¢;(z). 

The functional forms of the p;(x) are determined by the assumptions: 





(¢) The SiOz is a perfect insulator. 
(21) The generation and recombination rates for holes and electrons 
are zero. 
(zi¢) Hole and electron currents are zero at the time of observation. 
(tv) The flat-band voltage is zero. 
(v) The injected free holes and electrons are separately in 
equilibrium. 


Conditions (2) through (v) define a static device for which the p;(z) are: 
p1(2) = 0, 
p(x) = gL p(x) — Na(z) ], (2) 
p(x) = qLNv + p(x) — n(x) ], 


where n(x) and p(x) are the number densities of free electrons and 
holes, respectively. The most general expression for n(x) is 


n(z) =f (BDF (HAE, (3) 
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where g(£) is the density of states, F.(H) is the Fermi distribution for 
electrons, and E(x) is the conduction band edge. Substitution for g(Z) 
and F’.(E£) yields 


- (EZ — E.(x))*dE 


nhs zee) 1 + exp (EZ — E,)/kT) o 


This specific choice for the density of states corresponds to the simplest 
possible band structure. HE’; is the equilibrium Fermi level for electrons 
and N32 is the constant 

Ne = 4x(2m./h?*)}, (5) 


where m, is the effective mass of an electron in silicon and h is Planck’s 
constant. ® 
A similar expression for the distribution of holes is 


ple) = "gy PACBDAE, (6) 


where F(Z) = 1 — F.(E); substitution for g(E) and F;(e) yields 


ee (E(x) — E)*dE 
iD aaa sf _. T+exp (Ep — BE)/kT) (7) 


E,(x) is the valence band edge and E,, is defined as the pseudo-Fermi 
level’ for the holes. NV is the constant given by 


Ni = 4x (2m,/h?) 4, (8) 


where m, is the effective hole mass in silicon.* H,(a) and E(x) are func- 
tions of ¢:(x) given by: 


E(x) = Et — q¢x(), (9) 
E(x) = Ey — g¢i(z). (10) 


E and E? are the valence and conduction band edges at x = o, and 
g is the magnitude of an electronic charge. p(x) and n(x) are functions 
of the yet undetermined constants LH; and Ey,. Later they will appear 
only in the forms 

n = (Ee — E,)/kT, 

n = (E% — Eyzn)/kT. 


Since only difference terms appear, no energy reference need be estab- 
lished for the model. 7 will be determined from the electron charge 
neutrality condition at x = ©. 7’ is fixed by the total amount of free 
positive charge Q. = fn,° p(x)dx in the device. In reality, determina- 
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tion of 7’ is difficult, so the alternative scheme of choosing 7’ and cal- 
culating the resulting Q, is employed. Substituting eqs. (2), (4), and (7) 
into eq. (1) results in the system of differential equations which charac- 
terize the device described above. In the appendix these equations are 
reduced in a straightforward manner by using the Boltzman approxi- 
mation; justification for its use is given. The resulting equations which 
will be solved are (’ denotes d/dy): 


vily) = 0, OSyShi/, (11) 
va(y) = o — (m,/m,)3ere-¥2), hi/\ Sy She/r, (12) 
v3(y) =e — 1 — (m,/m,)¥ere™, — he/N Sy Sw, (18) 
where 
¥ = 9¢/kT, 
y =2/r, 


\ = VkTe2/q2N p, 
N, = N&kT)}, 
N, = N&kT)}, 
€, = the bandgap energy in k7’s, 
n = (E? — E;)/kT is a constant dependent 
on doping levels, 
n = (E% — E;s)/KT is a parameter depend- 
ing on the pseudo-Fermi level, 


Po = 4+ 1 — €, 
o = Na/Np. 


m, and m, are the effective masses of holes and electrons in silicon; 
they are 1.08m, and 0.59 m,, respectively. Equations (11), (12), and 
(13) satisfy boundary conditions 


¥i(0) = Vo (14) 
and, consistent with eqs. (9) and (10), 
y3(o) = 0. (15) 
At y = hi/\ and y = h2/d the continuity conditions are 
Wa(Ius/2) = Wal ha/r) (16) 
eWi(hi/dA) = €2ho(hi/A) 
W2(h2/A) = W3(he/r) (17) 


v2(he/ d) = v3(he/ d). 
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II. NUMERICAL SOLUTION 


The system of differential equations (11), (12), and (13) together 
with boundary equations (14), (15), (16), and (17) are solved numeri- 
cally by the method of finite differences. 

For computational convenience the length of the device will be 
truncated from the whole half-line to the segment [0, L ]; the distance 
L is chosen such that |¥3(©) — ¥3(Z)| and |y3(0) — ¥3(ZL)| are 
suitably small. [0, L] is partitioned by a mesh of N points and the 
solution at each point y; is denoted by y*. The mesh lengths (distance 
between successive points) are 6,, 6p, and 6, in the oxide, p-region, and 
n-region, respectively, and are chosen such that yy, = hi/\ and 
Yn, = h2/X. The boundaries are ¥? = ¥i(0) = V, and yp” = y,(L) 
~ W3(0) = 0. The subscripts on ¥(y) can now be dropped since the 
superscripts identify the solution point. The second derivative is ap- 
proximated by the second difference 


ae = (yt — yi + pit) /5? 


in each region; 6 is one of 6., 6x, or 6, aS appropriate. Using this ap- 
proximation to discretize eqs. (11), (12), and (13) results in the matrix 
equation 


(ALY) = Loy, ¥) J (18) 


which has rows generated by eq. (19); p(y:, ¥*) is defined in the three 
regions by the right-hand side of eq. (19). 


0, t~<M,, 
yt _ 2y' + yr = 1%, = (m »/Me )dere¥*) 5p, N, << No, (19) 
(ev — 1 — (m,/m,)8e°°—“) bn, t > No. 
Rows corresponding to the solutions at the boundary points yy, and 
yn, are obtained from eqs. (16) and (17) where the first derivative is 
approximated by 


FYy'(y) = (+11p(y) — 18¥(y + 4) 
+ 9(y + 26) — 2b(y + 36))/65, (20) 


and 6 is the appropriate mesh size for each region (the positive or for- 
ward derivative uses points to the left of y). Equation (20) results from 
simultaneous solution of the Taylor expansions for ¥(y + 6), ¥(y + 25), 
and y(y -+ 36) with terms 0(6*) and higher dropped; the result is ac- 
curate to 0(6%) which is consistent with the second difference approxi- 
mation to a*y/dy?. In the oxide layer, a first-difference approximation 


W(x) = (py — or) /5_ (21) 
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is adequate since the solution in this region is linear. The boundary 
equations are: 


fort = N,, 


— (66 p€1/5o€2) WRI! “+ (11 + 66 pe1/6 0€2)Y™! 
— 18y "iti 4 gyxit2 — Qy™r+3 = 0; (22) 
fort = Na, 
(1/6 p)(Qp%2-3 — OyNe-? + 18Ne-t) + 11y%9(1/6, + 1/55) 
— (1/6,)(18p%2F! — OyN2t? + QyN2t8) = 0. (23) 
The matrix A in eq. (18) is tridiagonal except for the Nth and Neth rows. 
For each choice of the parameter p., the {y*} of eq. (18) are solved for 


iteratively using a combination of successive under- and over-relaxa- 
tion!® (SOR) on the equation 


[¥*] = LAT Lous, ¥*) 1 (24) 


The procedure described here differs slightly from the usual SOR in 
that the transformed equation (24) is solved instead of eq. (18). For 
the ith row of eq. (24), the 7 + 1th estimate of y* is given as 


Vir = i + o(f' — Y), (25) 


where w is a relaxation parameter with values 0 < w S 2. fis a New- 
ton’s method solution of eq. (26), a transcendental equation in y‘ result- 
ing from the 7th row of eq. (24) with the remaining N — 2 variables y* 
held constant. The coefficients ag! in eq. (26) are elements of A~}. 


i-1 N71 
vi — ¥ aate(yn Ys) — aF'o(yn ¥) — X ae'o(yy vi) = 0. (26) 


Criteria for convergence of the process, as well as the choice of the 
value of w, is based on the residual 7; defined at the 7th iteration as 


Neti so 
gS a lv = ¥j-1|. (27) 


It can be shown that if (18) were a linear system of equations and if y* 
were the true solution at y; then 


sup |¥* — W4il/ sup [¥*| S C()rin, (28) 


1SiSN 1SiSN 


where Cw) is a constant dependent on the choice of w.!! For the opti- 
mum value of w, C(w) is O(N) while for values of w only slightly different 
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from the optimum C(w) can be 0(N?). In the computations presented 
in the next section, N S 100 so the iterative process was stopped when 
the residual terms were less than 10-8; this allowed margin for the fact 
that (18) is nonlinear. A discussion of the choice of w is necessarily even 
more heuristic. It was found that for a mesh of N = 50 points and large 
negative values of p, corresponding to small positive charge densities 
(i.e., e°°-¥* < 1 for all 2), the iterative scheme was convergent for any 
choice of w and any initial estimate of the {y*}. For less-negative values 
of p, and the corresponding larger values of e*°-“, the scheme was con- 
vergent for over-relaxation (w > 1) only if r; < 10 for all 7 (approxi- 
mate figure); but using under-relaxation the process was well behaved 
with 7,’s as great as 10%. It should be pointed out that due to the ex- 
ponential nature of the right-hand side of eq. (19) even a very good initial 
estimate typically resulted in 7; > 10% when over-relaxation was ap- 
plied. Although the SOR process described above may well be stable?” 
regardless of 7; values, the magnitude of the exponential terms in (19) 
limits machine computations. The combination of under- and over- 
relaxation detailed below eliminates this problem. 

Good initial estimates for the {y*} were obtained by choosing the p, 
values with equal spacing Ap; the first p, value being the small positive 
charge-density case described above. With Ap < 10 a linear estimate 
of the {y*} for successive values of p, was adequate. The iterations were 
under-relaxed (w = 0.5) until the residual term was less than 10; suc- 
cessive iterations were over-relaxed so as to increase the rate of con- 
vergence. The choice of w for the SOR steps was not critical; values in 
the range 1 < » S$ 1.5 had approximately the same rate of conver- 
gence. Values above 1.5 did not converge for all values of p.. 

Total positive charge in the device for a given value of p, is calcu- 
lated by 


YN» L 
Q+ =/ ero valudy +f ero ¥3 (wy, (29) 
YN, YNg 
Using a piecewise linear approximation for ¥(y) on [yx, yz+1], 


vy) =o + ((y — yx)/d) (Yt — p*), (30) 


and summing the integrals over each such interval in [yy,, L] gives the 
approximation 


No-l1 
Qs Sdn 2 erelenvs — ee) / — ¥) 


bn Oreo — eH /(yHH — yi), (31) 


1=N2 
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which can be modified to include the case y*t! = y*. Dimensional 
charge in coulombs can be calculated from Qi by Q = Q,N pi. 
The operational scheme may be summarized as follows: 


(t) Choose a value of p. corresponding to small total charge Q.. 
(po > — © in eq. (24) = Q; — 0), 
(i7) Solve for ¥(y), thus determining p(y), 
(i772) Integrate p(y) to find Q,, 
(tv) Increment p, by Ap and repeat (77), (27), and (2). 


The technique of starting with Q, ~ 0 and slowly adding positive 
charge to the device is important since it is this scheme of operation, 
in conjunction with the successive under- and over-relaxation, that 
avoids the exponential overflow limitations in machine computation. 


II. COMPUTATIONAL RESULTS 


In this section, numerical results for two specific device configura- 
tions will be given. The first device has a constant doping profile 
(N« = a constant) and dimensions 


hy/d = 0.48, 
ho/d = 12.48, 
L = 42.48. 
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Fig. 2—Potential versus distance for a buried-channel device with uniformly 
doped p-region and V, = —4 volts. 
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Fig. 3—Charge density versus distance for a buried-channel device with uniformly 
doped p-region. 


\ in this case is approximately 0.415 um. Doping levels are 
Na = 2 X 10% cm™, 
Np = 1 X 10% cm, 
sO 
C= Na/Nob = 20. 
N, is calculated by?® 


N. = 4.831 X 1015(m./m,)?T?, 


with 7 = 300°K and m, as before. 
Using eq. (46), 7 is found to be 


7 = 12.09. 
The nondegeneracy condition for this device may now be stated using 


eq. (45) 
€g ~ n! = v(y) = 3.5, 
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Fig. 4—Potential at the Si-SiO2 interface versus total positive charge in a buried- 
channel device with a uniform p-region and V, = —4 volts. 
or 


n’ — & — Wy) < —3.5. 
Adding 7 to both sides of the inequality gives 
po — WY) <4 — 3.5. 
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Fig. 5—Potential versus distance for a buried-channel device with a Gaussian 
p-region doping profile and V, = —4 volts. 
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Fig. 6—Charge density versus distance for a buried-channel device with a Gaussian 
doping profile. 


Then for each choice of the parameter p., the solution set {y*} must 
satisfy: 
Po — y? < 8.59. (32) 


From eq. (12) it is clear that if one assumes the Boltzmann approxi- 
mation to hold, then as long as Q+ < o(he — hi)/\ the equilibrium con- 
dition causes the lineal charge density to have constant sign so 


po — ¥* < log a(m./m,)# (33) 


for all z; for this device, eqs. (32) and (33) are always consistent if 
Na < 1.8 X 1038 cm=3. 

Computation was performed using a 90-point mesh and took ap- 
proximately 4.8 minutes of processor time on a Honeywell 6000-series 
machine. 

Figures 2, 3, and 4 summarize the computational results. Figure 2 
shows potential solutions {7} versus the points y; for a family of p, 
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Fig. 7—Potential at the Si-SiOz interface versus total positive charge in a buried- 
channel device with a Gaussian p-region doping profile and V, = —4 volts. 


values with V, = —4 volts. The range of total charge values is indi- 
cated. Figure 3 is a plot of the linear charge density with Q, values 
indicated. Figure 4 is Q; plotted against y%! (the oxide-interface 
potential). 

A realistic modification to the device studied thus far is to allow o to 
have y variation. The second set of results presented are for the same 
device as described above but with the p-region having a doping profile 


o(y) = (Dy + Lent hv! (a2-hy)}? In (Dot) — 2, (34) 


The values for hi, he, and V, are as before, L = 32.48, and D, = 46. 
[ This value of D, corresponds to an average doping level in the p-layer 
of N4 = 1.9 X 1015/cm, and eq. (34) describes a doping profile as if 
the p-layer were formed by drive-in diffusion.'4] The solutions {y*} 
must still satisfy eq. (32). Figures 5, 6, and 7 are a summary of results. 

Comparison of the two cases shows that for the doping profile in eq. 
(34) the potential minimum is greater and the “channel” is shifted 
toward the oxide. In both cases the added positive charge is contained 
entirely in the p-region and it fills the region starting from the side 
remote from the oxide interface (see Figs. 2 and 5). 
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APPENDIX 
Substituting eqs. (2), (4), and (7) into eq. (1) results in (’ denotes 
d/dzx): 
¢i(z)=0, OSa4Shy, (35) 
px ae ee a (E(x) — E)*dE 
$2 (x) fe €2 {W. Nef 1 + exp [(E'sn = E)/kT ] ’ 
hy ie re ho, (36) 
i q s (EF — E,(x))idE 
=-+ N° —_—__.___*.___. — N 
ie) = Line” crept: Barer] ~ 
iif CO ae 
‘J-0 1+ exp((Byp — £)/kT J’ 
heSxs om. (87) 
Transform the integral involving E(x) by making the substitutions: 
e = (EF — E,(z))/kT, 
E.(a) = Et — q¢e(z), 
n = (BE — Ey)/kT. 
For the integrals involving L(x) make the substitutions: 
e = (E,(z) — E)/kT, 
E(x) = ES — ge(a), 
q = (Ee — Eps) /kT, 
ég = (Ee — Et) /kT. 
¥(x) = qo(x)/kT 
and make the change of variables 
y = a/h, 


\= VKT €2/q?N pv. 


These substitutions and some straightforward manipulation reduce 
eqs. (385), (86), and (37) to 
vey) =0, OS yShi/, (38) 
” - Na - N° ; | 2 etde 
W2(y) == Nop No (kT) ‘ 1 + exp (e + ég — 7! + v2(y)) ’ 
hi/\ Sy She/d, (89) 


For both cases let 


where 
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ees sf? etde - 
vi) = HE aD'{ Stet val - 
N? %0 etde 
_ No Be Oe oe Se 
vp &P) o 1 +exp (e+e, — n' + ¥s(y))’ 


hoe/A Sy S ~. (40) 


These equations are further simplified by making the classical Boltz- 
mann approximation for the electrons: 


% etde safe etde 
lomecnpowp2) ae 1! 


which holds since » — ¥(y) is always positive by several k7’ (see Fig. 
8). The last result is integrable and simplifies the approximation to 


P(g) exp (¥(y) — 1), (42) 


where I'(-) is the usual gamma function. This approximation has been 
shown to be less than one percent in error if! 


n— ¥(y) > 3.5. (43) 
Similarly, for the holes, 


-. etde Mectelie ee 
iB Ltepet.,—7 + vay T'(3) exp (n’ — eg — ¥(y)) (44) 
if 
ég— 1 + Hy) > 3.5. (45) 


The validity of this last approximation is not clear since ¥(y) is de- 
pendent on 7’ and ¥(y) becomes large and negative. Note that requiring 
the inequalities (43) and (45) to hold is equivalent to requiring the 
device to be nondegenerate. Computational results show this approxi- 
mation to be consistent. 

n is expressed in terms of the constants N,, N,, and Vp by requiring 
electron charge neutrality at x = o. It should be stated that the cor- 
rect condition to use at this point is complete charge neutrality. This 
requirement, however, is computationally indistinguishable from the 
algebraically simpler condition used here. Using eqs. (40) and (15), the 
neutrality condition is 


(N./N v)e"T' (3) — 1 = 0, 
sO 


N./Np = e"/T(3). (46) 
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Fig. 8—Band diagram for buried-channel device. 


Equation (46) defines the Fermi level for the electrons. Using eqs. (5) 
and (8), 


N./Np = (N./N.)-(Ne/Nb) = (m./m-)¥e7/T (3), (47) 


where m, and m, are the effective masses of holes and electrons in 
silicon; they are 1.08 m, and 0.59 m,, respectively.1° Using eqs. (42), 
(44), (46), and (47) in eqs. (38), (39), and (40) results in equations 
(11), (12), and (13) which are to be solved. 
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The Effect of Rain on Circular Polarization at 18 GHz 


By R. A. SEMPLAK 
(Manuscript received February 14, 1973) 


Limitations imposed by attenuation during heavy rain on the reli- 
ability of microwave systems are well known,! and some calculations 
of the depolarization of linearly? and circularly* polarized waves have 
been made. Recently, measurements of rain-induced rotation‘ of linear 
polarization at 30 GHz indicated that depolarization by large oblate 
raindrops® may limit efficient utilization of a microwave channel where 
orthogonal polarizations are employed. However, an advantage in 
using circular polarization is that less-stringent mechanical stability 
may be required of some antennas; also, in satellite systems, circular 
polarization is not affected by Faraday rotation. As the effect of rain 
on transmission of circular polarization had not been measured, an 
experiment was initiated. Data have been collected for the period June 
1, 1972, through January 24, 1973. 

A frequency-swept Gunn oscillator operating at a frequency of 18.5 
GHz is used as a source in circular polarization on a 2.6-km path ori- 
ented in a southeasterly direction from Crawford Hill, Holmdel, New 
Jersey, the site of the receiver. The receiver has a ferrite switch which 
looks sequentially at the received fields, i.e., the desired circular polari- 
zation and then the depolarized component, the opposite sense, are 
observed.* Paper strip chart recordings are made of both the desired 
and depolarized components. 

The clear-day polarization discrimination of the system is better 
than 32 dB. In view of the ambiguities associated with measuring cross 
polarization below —382 dB, none of the data below —32 dB are 
included. 

The total data obtained from the circularly polarized system are 
shown in Fig. 1 where the depolarized component (the field measured in 
the undesired sense) is plotted as a function of the rain-induced at- 
tenuation. One can see that rain has a strong depolarizing effect. For 
example, the very deep rain-induced attenuations of 39-40 dB have 


* The switching rate is 17 Hz; this is much faster than the changes in attenuation 
produced by rain. 
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Fig. 1—Data on circular polarization from June 1, 1972, through January 24, 1973. 
Depolarized component plotted as a function of rain-induced attenuation. 


depolarized components that are only 9-15 dB below that level. The 
curve shown in Fig. 1 is an estimate of the median of the data. From the 
figure, we observe, for example, that a rain-induced fade of 20 dB has 
associated with it a depolarized component with a median value of the 
order of —15 dB. Likewise, for a modest rain-induced fade of 10 dB, 
the median value of the depolarized component is about 23 dB below 
that level. 

From measured data at 18 GHz it is concluded that there are serious 
polarization discrimination problems for circular polarization during 
periods of rain. However, circular depolarization should not be as 
serious for frequencies of 60 GHz and higher, since at these frequencies 
the small raindrops have the strongest effect on transmission, and these 
small drops tend to be spherical rather than oblate. Measurements not 
discussed here show by comparison that the attenuation of circularly 
polarized waves by rain lies between that for horizontally and vertically 
polarized waves.’ 
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Attenuation Through the Clear Atmosphere at 30, 19, 
and 13 GHz for Low Elevation Angles 


By PAUL S. HENRY 
(Manuscript received February 21, 1973) 


I, INTRODUCTION 


Synchronous satellite service for Alaska, and possibly other places, 
requires that ground station antennas point at low elevation angles. 
For example, from Point Barrow (71°N, 155° W), such a satellite would 
never be more than 11 degrees above the horizon, and for satellite 
longitudes 45 degrees east or west of Point Barrow, the elevation is only 
5 degrees. At such low angles, the attenuation of a nominally clear 
atmosphere is significant in the 18- and 30-GHz bands of proposed 
domestic satellite systems. There are also satellite bands near 138 GHz, 
where the attenuation is expected to be somewhat lower. Predictions 
of this attenuation have been made,! but as a check direct measure- 
ments have been obtained with the Crawford Hill Sun Tracker as re- 
ported below. 


II. APPARATUS AND PROCEDURE 


The experimental setup, described in detail elsewhere,’ is briefly 
this: the antenna temperatures of the sun and a nearby patch of sky 
are compared by means of a radiometer. The temperature difference, 
AT, is related to the excess attenuation above the solar noon value, 
A, by the formula 

A(dB) = 10 log [AT/AT»], (1) 
where AZ) is the antenna temperature difference at solar noon on a 
clear day. As the antenna follows the setting sun, attenuation as a 
function of elevation angle is measured directly. 
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The measured attenuations are increases above the noontime value. 
Thus the absolute attenuation is the measured value plus the atmo- 
spheric attenuation at noon; the latter has been estimated to be 0.3, 
0.3, and 0.1 dB at 30, 19, and 13 GHz.! For the remainder of this paper, 
quoted attenuations are referred to noontime as the zero of attenuation. 

A small correction must be applied to the Sun Tracker raw data to 
account for different antenna temperatures in the “‘sun” and “sky” 
positions due to differences in atmospheric radiation. This correction 
is readily determined by measuring AT for low elevation angles of the 
antenna beam, while the sun is high in the sky and thus out of the 
beam. The magnitude of the correction is less than 3 percent of the 
measured attenuation in dB for elevations of 5 degrees or greater. 
Below 5 degrees, the correction rises to a maximum of 8 percent at 
3 degrees elevation. 


III. DATA 


The measured attenuations, corrected as described above, are shown 
as a function of elevation angle in Fig. 1. The data, collected during 
four sunsets in August 1972, are shown as bars spanning the full range 
of the values observed. During the measurements the absolute surface 
humidity was about 12.5 gm/m', which is typical of summertime New 
Jersey. The curves through the bars represent the average behavior 
of the data. At 5-degree and 10-degree elevations are indicated the 
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Fig. 1—Attenuation vs elevation angle, normal summer weather (average hu- 
midity 12.5 gm/m‘), 
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Fig. 2—Attenuation vs elevation angle, humid weather (August 7, 1972, humidity 
23 gm/m’). 
attenuation predictions of Hogg, normalized to zero dB at noon. Also 
shown for the 30-GHz attenuation are the points corresponding to a 
cosec 6 law, normalized to agree with observation at 10 degrees and 
12 degrees. 

Figure 2 is a plot of measured attenuation versus elevation for very 
humid air. These measurements were made on August 7, 1972, shortly 
after a rainstorm when the atmosphere had cleared. The surface hu- 
midity was 23 gm/m? at the time of observation. For comparison, the 
curves from Fig. 1 also are shown. The increased attenuation is clearly 
visible. Even larger attenuations are possible. Hogg and Semplak’ have 
calculated that very humid weather can result in attenuations more 
than double those shown in Fig. 2. 


IV. EXTRAPOLATION TO OTHER ATMOSPHERIC CONDITIONS 


The atmospheric attenuation is due primarily to oxygen and water 
vapor. In summertime New Jersey, the fraction of the total attenua- 
tion attributable to water vapor is roughly 50, 75, and 50 percent at 
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30, 19, and 13 GHz, respectively.! Thus one can convert attenuations 
measured at one humidity to values corresponding to a different water 
vapor content. For example, we can predict the attenuations that 
should have been observed under the conditions of Fig. 2 (23 gm/m*) 
by doubling (very nearly) the contributions due to water vapor in 
Fig. 1 (12.5 gm/m*). This simple extrapolation rule predicts the ob- 
served attenuation to an accuracy of 10 to 15 percent. 

Compared with New Jersey, the Point Barrow atmosphere contains 
essentially no water. In winter the humidity is 0.5 gm/m* and in 
summer 5 gm/m‘*.4 Therefore, a first approximation for conversion of 
the attenuations of Fig. 1 to Alaskan conditions consists in neglecting 
the water vapor entirely and simply reducing the observed attenua- 
tions by 50, 75, and 50 percent at 30, 19, and 13 GHz. 


Vv. ERRORS 


There are two main sources of systematic error. The first is occul- 
tation of the setting sun by objects near the horizon. At the Crawford 
Hill location the Sun Tracker has a clear view down to about 3 degrees 
elevation. The second error is an increase in antenna beamwidth in the 
vertical plane due to the gradient of atmospheric refraction within the 
antenna beam.’ The forward gain of the antenna is thus reduced, re- 
sulting in an apparent increase in attenuation. The magnitude of this 
effect depends on the size of the source being observed. A ‘‘point’’ 
source, such as a synchronous satellite, would show an attenuation 
about 0.1 dB above the values reported here. 

Strictly speaking, the points calculated by Hogg shown in Fig. 1 
should not be compared directly with the data. Although the attenua- 
tion at solar noon has been subtracted from them (they are normalized 
to 0 dB at noon), they still do not correspond to the conditions of this 
experiment. Hogg assumed a humidity of 10 gm/m?—a value 20 percent 
below the prevailing humidity during the observations. A rough 
correction to Hogg’s values would involve scaling his attenuations 
(in dB) up by 10, 15, and 10 percent at 30, 19, and 13 GHz. 

Fluctuations in the data are attributable to three sources: (7) error 
in reading attenuations on the chart recorder, (7) error in reading the 
time on the chart, and (277) changes in humidity over the course of the 
observations. The first-mentioned error, called 6A below, is estimated 
to be about 0.2 dB rms, one-fifth of the smallest chart division. The 
timing error is significant because it leads to an uncertainty in the 
elevation angle, 6@ below, at which a particular measurement was made. 
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A reasonable estimate for this error is 1 minute rms, which means 40 is 
0.25 degree. Finally, the diurnal changes in humidity, a, were about 
20 percent. All these errors add in quadrature to give a resultant over- 
all error estimate 


EP = (6A)? + S?(88)? + (faA)?, (2) 


where S is the slope of the attenuation versus elevation angle curve 
(see Fig. 1), f is the fraction of the attenuation due to water vapor, 
and A is the measured attenuation. The errors represented by the 
three terms of eq. (2) are of comparable magnitude. Thus the noise in 
the data must be ascribed to both instrumental and “external” sources. 

The rms fluctuations predicted by eq. (2) can be compared with the 
observed scatter in the measurements at various frequencies and 
angles. Typically the two differ by only 0.2 to 0.3 dB, indicating that 
the stochastic processes operating in this experiment are reasonably 
well understood. 


VI. CONCLUSIONS 


The 13-, 19-, and 30-GHz attenuation of a clear atmosphere at low 
elevation angles has been measured by the Crawford Hill Sun Tracker. 
The results are in good agreement with predictions. Extrapolation of 
the measurements to Alaskan conditions yields attenuations sub- 
stantially below those measured in New Jersey. A significant improve- 
ment in the measurements using the Crawford Hill Sun Tracker can 
be made only under conditions of reduced and/or more stable atmo- 
spheric water vapor content. 
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